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Preface 


The goal of this textbook is twofold. First, the book serves as an introduction 
to the field of parameterized algorithms and complexity accessible to graduate 
students and advanced undergraduate students. Second, it contains a clean 
and coherent account of some of the most recent tools and techniques in the 
area. 

Parameterized algorithmics analyzes running time in finer detail than clas- 
sical complexity theory: instead of expressing the running time as a function 
of the input size only, dependence on one or more parameters of the input in- 
stance is taken into account. While there were examples of nontrivial param- 
eterized algorithms in the literature, such as Lenstra’s algorithm for integer 
linear programming or the disjoint paths algorithm of Robertson and 
Seymour [402], it was only in the late 1980s that Downey and Fellows [149], 
building on joint work with Langston [L83], proposed the system- 
atic exploration of parameterized algorithms. Downey and Fellows laid the 
foundations of a fruitful and deep theory, suitable for reasoning about the 
complexity of parameterized algorithms. Their early work demonstrated that 
fixed-parameter tractability is a ubiquitous phenomenon, naturally arising 
in various contexts and applications. The parameterized view on algorithms 
has led to a theory that is both mathematically beautiful and practically ap- 
plicable. During the 30 years of its existence, the area has transformed into 
a mainstream topic of theoretical computer science. A great number of new 
results have been achieved, a wide array of techniques have been created, 
and several open problems have been solved. At the time of writing, Google 
Scholar gives more than 4000 papers containing the term “fixed-parameter 
tractable”. While a full overview of the field in a single volume is no longer 
possible, our goal is to present a selection of topics at the core of the field, 
providing a key for understanding the developments in the area. 
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Why This Book? 


The idea of writing this book arose after we decided to organize a summer 
school on parameterized algorithms and complexity in Bedlewo in August 
2014. While planning the school, we realized that there is no textbook that 
contains the material that we wanted to cover. The classical book of Downey 
and Fellows summarizes the state of the field as of 1999. This book 
was the starting point of a new wave of research in the area, which is ob- 
viously not covered by this classical text. The area has been developing at 
such a fast rate that even the two books that appeared in 2006, by Flum and 
Grohe and Niedermeier [376], do not contain some of the new tools and 
techniques that we feel need to be taught in a modern introductory course. 
Examples include the lower bound techniques developed for kernelization in 
2008, methods introduced for faster dynamic programming on tree decompo- 
sitions (starting with Cut & Count in 2011), and the use of algebraic tools for 
problems such as LONGEST PATH. The book of Flum and Grohe focuses 
to a large extent on complexity aspects of parameterized algorithmics from 
the viewpoint of logic, while the material we wanted to cover in the school is 
primarily algorithmic, viewing complexity as a tool for proving that certain 
kinds of algorithms do not exist. The book of Niedermeier gives a gen- 
tle introduction to the field and some of the basic algorithmic techniques. In 
2013, Downey and Fellows published the second edition of their clas- 
sical text, capturing the development of the field from its nascent stages to 
the most recent results. However, the book does not treat in detail many 
of the algorithmic results we wanted to teach, such as how one can apply 
important separators for EDGE MULTIWAY CUT and DIRECTED FEEDBACK 
VERTEX SET, linear programming for ALMOST 2-SAT, Cut & Count and 
its deterministic counterparts to obtain faster algorithms on tree decompo- 
sitions, algorithms based on representative families of matroids, kernels for 
FEEDBACK VERTEX SET, and some of the reductions related to the use of 
the Strong Exponential Time Hypothesis. 

Our initial idea was to prepare a collection of lecture notes for the school, 
but we realized soon that a coherent textbook covering all basic topics in 
equal depth would better serve our purposes, as well as the purposes of those 
colleagues who would teach a semester course in the future. We have or- 
ganized the material into chapters according to techniques. Each chapter 
discusses a certain algorithmic paradigm or lower bound methodology. This 
means that the same algorithmic problem may be revisited in more than one 
chapter, demonstrating how different techniques can be applied to it. Thanks 
to the rapid growth of the field, it is now nearly impossible to cover every 
relevant result in a single textbook. Therefore, we had to carefully select what 
to present at the school and include in the book. Our goal was to include a 
self-contained and teachable exposition of what we believe are the basic tech- 
niques of the field, at the expense of giving a complete survey of the area. A 
consequence of this is that we do not always present the strongest result for 
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a particular problem. Nevertheless, we would like to point out that for many 
problems the book actually contains the state of the art and brings the reader 
to the frontiers of research. We made an effort to present full proofs for most 
of the results, where this was feasible within the textbook format. We used 
the opportunity of writing this textbook to revisit some of the results in the 
literature and, using the benefit of hindsight, to present them in a modern 
and didactic way. 


At the end of each chapter we provide sections with exercises, hints to 


exercises and bibliographical notes. Many of the exercises complement the 
main narrative and cover important results which have to be omitted due 
to space constraints. We use (&) and (§%) to identify easy and challenging 
exercises. Following the common practice for textbooks, we try to minimize 
the occurrence of bibliographical and historical references in the main text 
by moving them to bibliographic notes. These notes can also guide the reader 
on to further reading. 


Organization of the Book 


The book is organized into three parts. The first seven chapters give the 
basic toolbox of parameterized algorithms, which, in our opinion, every course 
on the subject should cover. The second part, consisting of Chapters 
covers more advanced algorithmic techniques that are featured prominently 
in current research, such as important separators and algebraic methods. 
The third part introduces the reader to the theory of lower bounds: the 
intractability theory of parameterized complexity, lower bounds based on the 
Exponential Time Hypothesis, and lower bounds on kernels. We adopt a 
very pragmatic viewpoint in these chapters: our goal is to help the algorithm 
designer by providing evidence that certain algorithms are unlikely to exist, 
without entering into complexity theory in deeper detail. Every chapter is 
accompanied by exercises, with hints for most of them. Bibliographic notes 
point to the original publications, as well as to related work. 


Chapter |1| motivates parameterized algorithms and the notion of fixed- 
parameter tractability with some simple examples. Formal definitions of 
the main concepts are introduced. 

Kernelization is the first algorithmic paradigm for fixed-parameter tractabil- 
ity that we discuss. Chapter [2] gives an introduction to this technique. 
Branching and bounded-depth search trees are the topic of Chapter 
We discuss both basic examples and more advanced applications based on 
linear programming relaxations, showing the fixed-parameter tractability 
of, e.g., ODD CYCLE TRANSVERSAL and ALMOST 2-SAT. 

Iterative compression is a very useful technique for deletion problems. 
Chapter |4| introduces the technique through three examples, including 
FEEDBACK VERTEX SET and ODD CYCLE TRANSVERSAL. 
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e Chapter [5|discusses techniques for parameterized algorithms that use ran- 
domization. The classic color coding technique for LONGEST PATH will 
serve as an illustrative example. 

e Chapter [6]presents a collection of techniques that belong to the basic tool- 
box of parameterized algorithms: dynamic programming over subsets, in- 
teger linear programming (ILP), and the use of well-quasi-ordering results 
from graph minors theory. 

e Chapter [7|introduces treewidth, which is a graph measure that has impor- 
tant applications for parameterized algorithms. We discuss how to use dy- 
namic programming and Courcelle’s theorem to solve problems on graphs 
of bounded treewidth and how these algorithms are used more generally, 
for example, in the context of bidimensionality for planar graphs. 

e Chapter|8|presents results that are based on a combinatorial bound on the 
number of so-called “important separators”. We use this bound to show the 
fixed-parameter tractability of problems such as EDGE MULTICUT and DI- 
RECTED FEEDBACK VERTEX SET. We also discuss randomized sampling 
of important cuts. 

e The kernels presented in Chapter [J] form a representative sample of more 
advanced kernelization techniques. They demonstrate how the use of min- 
max results from graph theory, the probabilistic method, and the proper- 
ties of planar graphs can be exploited in kernelization. 

e Two different types of algebraic techniques are discussed in Chapter 
algorithms based on the inclusion—exclusion principle and on polynomial 
identity testing. We use these techniques to present the fastest known 
parameterized algorithms for STEINER TREE and LONGEST PATH. 

e In Chapter [17] we return to dynamic programming algorithms on graphs 
of bounded treewidth. This chapter presents three methods (subset convo- 
lution, Cut & Count, and a rank-based approach) for speeding up dynamic 
programming on tree decompositions. 

e The notion of matroids is a fundamental concept in combinatorics and 
optimization. Recently, matroids have also been used for kernelization and 
parameterized algorithms. Chapter [12] gives a gentle introduction to some 
of these developments. 

e Chapter [13] presents tools that allow us to give evidence that certain prob- 
lems are not fixed-parameter tractable. The chapter introduces parame- 
terized reductions and the W-hierarchy, and gives a sample of hardness 
results for various concrete problems. 

e Chapter[14}uses the (Strong) Exponential Time Hypothesis to give running 
time lower bounds that are more refined than the bounds in Chapter [13] 
In many cases, these stronger complexity assumptions allow us to obtain 
lower bounds essentially matching the best known algorithms. 

e Chapter [15|gives the tools for showing lower bounds for kernelization algo- 
rithms. We use methods of composition and polynomial-parameter trans- 
formations to show that certain problem, such as LONGEST PATH, do not 
admit polynomial kernels. 
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Fig. 0.1: Dependencies between the chapters 


As in any textbook we will assume that the reader is familiar with the 
content of one chapter before moving to the next. On the other hand, for 
most chapters it is not necessary for the reader to have read all preceeding 
chapters. Thus the book does not have to be read linearly from beginning to 
end. Figure [0.1] depicts the dependencies between the different chapters. For 
example, the chapters on Iterative Compression and Bounded Search Trees 
are considered necessary prerequisites to understand the chapter on finding 
cuts and separators. 


Using the Book for Teaching 


A course on parameterized algorithms should cover most of the material in 
Part I, except perhaps the more advanced sections marked with an asterisk. 
In Part II, the instructor may choose which chapters and which sections to 
teach based on his or her preferences. Our suggestion for a coherent set of 
topics from Part II is the following: 
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All of Chapter |8| as it is relatively easily teachable. The sections of this 
chapter are based on each other and hence should be taught in this order, 
except that perhaps Section [8.4]and Sections |8.5}8-6]are interchangeable. 
Chapter[9|contains four independent sections. One could select Section [9.1] 
(FEEDBACK VERTEX SET) and Section[9.3| (CONNECTED VERTEX COVER 
on planar graphs) in a first course. 

From Chapter [10] we suggest presenting Section [10.1] (inclusion—exclusion 
principle), and Section [10.4.1] (Lonarst PATH in time 2® . nO), 

From Chapter [11| we recommend teaching Sections [11.2.1]and as 
they are most illustrative for the recent developments on algorithms on 
tree decompositions. 

From Chapter[12]we recommend teaching Section [12.3] If the students are 
unfamiliar with matroids, Section [12.1] provides a brief introduction to the 
topic. 


Part III gives a self-contained exposition of the lower bound machinery. In 


this part, the sections not marked with an asterisk give a set of topics that 
can form the complexity part of a course on parameterized algorithms. In 
some cases, we have presented multiple reductions showcasing the same kind 
of lower bounds; the instructor can choose from these examples according to 


the needs of the course. Section |14.4.1] contains some more involved proofs, 


but one can give a coherent overview of this section even while omitting most 
of the proofs. 
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A squirrel, a platypus and a hamster walk into a bar... 


Imagine that you are an exceptionally tech-savvy security guard of a bar 
in an undisclosed small town on the west coast of Norway. Every Friday, 
half of the inhabitants of the town go out, and the bar you work at is well 
known for its nightly brawls. This of course results in an excessive amount 
of work for you; having to throw out intoxicated guests is tedious and rather 
unpleasant labor. Thus you decide to take preemptive measures. As the town 
is small, you know everyone in it, and you also know who will be likely to 
fight with whom if they are admitted to the bar. So you wish to plan ahead, 
and only admit people if they will not be fighting with anyone else at the 
bar. At the same time, the management wants to maximize profit and is not 
too happy if you on any given night reject more than k people at the door. 
Thus, you are left with the following optimization problem. You have a list 
of all of the n people who will come to the bar, and for each pair of people 
a prediction of whether or not they will fight if they both are admitted. You 
need to figure out whether it is possible to admit everyone except for at most 
k troublemakers, such that no fight breaks out among the admitted guests. 
Let us call this problem the BAR FIGHT PREVENTION problem. Figure [1-1] 
shows an instance of the problem and a solution for k = 3. One can easily 
check that this instance has no solution with k = 2. 
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Fig. 1.1: An instance of the BAR FIGHT PREVENTION problem with a solution 
for k = 3. An edge between two guests means that they will fight if both are 
admitted 


Efficient algorithms for BAR FIGHT PREVENTION 


Unfortunately, BAR FIGHT PREVENTION is a classic NP-complete problem 
(the reader might have heard of it under the name VERTEX COVER), and so 
the best way to solve the problem is by trying all possibilities, right? If there 
are n = 1000 people planning to come to the bar, then you can quickly code up 
the brute-force solution that tries each of the 21° ~ 1.07-10°°! possibilities. 
Sadly, this program won’t terminate before the guests arrive, probably not 
even before the universe implodes on itself. Luckily, the number k of guests 
that should be rejected is not that large, k < 10. So now the program only 
needs to try (99°) ~ 2.63 - 10?3 possibilities. This is much better, but still 
quite infeasible to do in one day, even with access to supercomputers. 

So should you give up at this point, and resign yourself to throwing guests 
out after the fights break out? Well, at least you can easily identify some 
peaceful souls to accept, and some troublemakers you need to refuse at the 
door for sure. Anyone who does not have a potential conflict with anyone else 
can be safely moved to the list of people to accept. On the other hand, if some 
guy will fight with at least k + 1 other guests you have to reject him — as 
otherwise you will have to reject all of his k+1 opponents, thereby upsetting 
the management. If you identify such a troublemaker (in the example of 
Fig. Daniel is such a troublemaker), you immediately strike him from 
the guest list, and decrease the number k of people you can reject by one|!| 

If there is no one left to strike out in this manner, then we know that each 
guest will fight with at most k other guests. Thus, rejecting any single guest 
will resolve at most k potential conflicts. And so, if there are more than k? 





1 The astute reader may observe that in Fig(1.1| after eliminating Daniel and setting k = 2, 
Fedor still has three opponents, making it possible to eliminate him and set k = 1. Then 
Bob, who is in conflict with Alice and Christos, can be eliminated, resolving all conflicts. 
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potential conflicts, you know that there is no way to ensure a peaceful night 
at the bar by rejecting only k guests at the door. As each guest who has not 
yet been moved to the accept or reject list participates in at least one and at 
most k potential conflicts, and there are at. most k? potential conflicts, there 
are at most 2k? guests whose fate is yet undecided. Trying all possibilities 
for these will need approximately ea < (Ca) zx 2.24-10!° checks, which 
is feasible to do in less than a day on a modern supercomputer, but quite 
hopeless on a laptop. 

If it is safe to admit anyone who does not participate in any potential 
conflict, what about those who participate in exactly one? If Alice has a 
conflict with Bob, but with no one else, then it is always a good idea to admit 
Alice. Indeed, you cannot accept both Alice and Bob, and admitting Alice 
cannot be any worse than admitting Bob: if Bob is in the bar, then Alice has 
to be rejected for sure and potentially some other guests as well. Therefore, 
it is safe to accept Alice, reject Bob, and decrease k by one in this case. This 
way, you can always decide the fate of any guest with only one potential 
conflict. At this point, each guest you have not yet moved to the accept or 
reject list participates in at least two and at most k potential conflicts. It is 
easy to see that with this assumption, having at most k? unresolved conflicts 
implies that there are only at most k? guests whose fate is yet undecided, 
instead of the previous upper bound of 2k?. Trying all possibilities for which 
of those to refuse at the door requires Cc) < ea = 1.73 - 1018 checks. 
With a clever implementation, this takes less than half a day on a laptop, 
so if you start the program in the morning you'll know who to refuse at the 
door by the time the bar opens. Therefore, instead of using brute force to 
go through an enormous search space, we used simple observations to reduce 
the search space to a manageable size. This algorithmic technique, using 
reduction rules to decrease the size of the instance, is called kernelization, 
and will be the subject of Chapter |2| (with some more advanced examples 
appearing in Chapter [9). 

It turns out that a simple observation yields an even faster algorithm for 
BAR FIGHT PREVENTION. The crucial point is that every conflict has to 
be resolved, and that the only way to resolve a conflict is to refuse at least 
one of the two participants. Thus, as long as there is at least one unresolved 
conflict, say between Alice and Bob, we proceed as follows. Try moving Alice 
to the reject list and run the algorithm recursively to check whether the 
remaining conflicts can be resolved by rejecting at most k — 1 guests. If this 
succeeds you already have a solution. If it fails, then move Alice back onto the 
undecided list, move Bob to the reject list and run the algorithm recursively 
to check whether the remaining conflicts can be resolved by rejecting at most 
k — 1 additional guests (see Fig. (1.2). If this recursive call also fails to find 
a solution, then you can be sure that there is no way to avoid a fight by 
rejecting at most k guests. 

What is the running time of this algorithm? All it does is to check whether 
all conflicts have been resolved, and if not, it makes two recursive calls. In 
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Alice vs. Bob 





Fail Fail Fail Fail Fail Fail Fail OK 


Fig. 1.2: The search tree for BAR FIGHT PREVENTION with k = 3. In the 
leaves marked with “Fail”, the parameter k is decreased to zero, but there 
are still unresolved conflicts. The rightmost branch of the search tree finds a 
solution: after rejecting Bob, Daniel, and Fedor, no more conflicts remain 


both of the recursive calls the value of k decreases by 1, and when k reaches 
0 all the algorithm has to do is to check whether there are any unresolved 
conflicts left. Hence there is a total of 2} recursive calls, and it is easy to 
implement each recursive call to run in linear time O(n + m), where m is 
the total number of possible conflicts. Let us recall that we already achieved 
the situation where every undecided guest has at most k conflicts with other 
guests, so m < nk/2. Hence the total number of operations is approximately 
2K. n.k < 21° . 10,000 = 10,240,000, which takes a fraction of a second 
on today’s laptops. Or cell phones, for that matter. You can now make the 
BAR FIGHT PREVENTION app, and celebrate with a root beer. This simple 
algorithm is an example of another algorithmic paradigm: the technique of 
bounded search trees. In Chapter |3| we will see several applications of this 
technique to various problems. 

The algorithm above runs in time O(2" - k- n), while the naive algorithm 
that tries every possible subset of k people to reject runs in time O(n*). 
Observe that if k is considered to be a constant (say k = 10), then both 
algorithms run in polynomial time. However, as we have seen, there is a quite 
dramatic difference between the running times of the two algorithms. The 
reason is that even though the naive algorithm is a polynomial-time algorithm 
for every fixed value of k, the exponent of the polynomial depends on k. 
On the other hand, the final algorithm we designed runs in linear time for 
every fixed value of k! This difference is what parameterized algorithms and 
complexity is all about. In the O(2*-k-n)-time algorithm, the combinatorial 
explosion is restricted to the parameter k: the running time is exponential 
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in k, but depends only polynomially (actually, linearly) on n. Our goal is to 
find algorithms of this form. 


Algorithms with running time f(k). n°, for a constant c independent of 
both n and k, are called fixed-parameter algorithms, or FPT algorithms. 
Typically the goal in parameterized algorithmics is to design FPT al- 
gorithms, trying to make both the f(k) factor and the constant c in 
the bound on the running time as small as possible. FPT algorithms 
can be put in contrast with less efficient XP algorithms (for slice-wise 
polynomial), where the running time is of the form f(k)-n9“), for some 
functions f,g. There is a tremendous difference in the running times 
f(k)-n9™ and f(k)- n°. 


In parameterized algorithmics, k is simply a relevant secondary mea- 
surement that encapsulates some aspect of the input instance, be it the 
size of the solution sought after, or a number describing how “struc- 
tured” the input instance is. 


A negative example: vertex coloring 


Not every choice for what k measures leads to FPT algorithms. Let us have 
a look at an example where it does not. Suppose the management of the 
hypothetical bar you work at doesn’t want to refuse anyone at the door, but 
still doesn’t want any fights. To achieve this, they buy k — 1 more bars across 
the street, and come up with the following brilliant plan. Every night they 
will compile a list of the guests coming, and a list of potential conflicts. Then 
you are to split the guest list into k groups, such that no two guests with 
a potential conflict between them end up in the same group. Then each of 
the groups can be sent to one bar, keeping everyone happy. For example, 
in Fig. we may put Alice and Christos in the first bar, Bob, Erik, and 
Gerhard in the second bar, and Daniel and Fedor in the third bar. 

We model this problem as a graph problem, representing each person as 
a vertex, and each conflict as an edge between two vertices. A partition of 
the guest list into k groups can be represented by a function that assigns 
to each vertex an integer between 1 and k. The objective is to find such a 
function that, for every edge, assigns different numbers to its two endpoints. 
A function that satisfies these constraints is called a proper k-coloring of the 
graph. Not every graph has a proper k-coloring. For example, if there are 
k + 1 vertices with an edge between every pair of them, then each of these 
vertices needs to be assigned a unique integer. Hence such a graph does not 
have a proper k-coloring. This gives rise to a computational problem, called 
VERTEX COLORING. Here we are given as input a graph G and an integer k, 
and we need to decide whether G has a proper k-coloring. 
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It is well known that VERTEX COLORING is NP-complete, so we do not 
hope for a polynomial-time algorithm that works in all cases. However, it is 
fair to assume that the management does not want to own more than k = 5 
bars on the same street, so we will gladly settle for a O(2*-n°)-time algorithm 
for some constant c, mimicking the success we had with our first problem. 
Unfortunately, deciding whether a graph G has a proper 5-coloring is NP- 
complete, so any f(k) - n°-time algorithm for VERTEX COLORING for any 
function f and constant c would imply that P = NP; indeed, suppose such 
an algorithm existed. Then, given a graph G, we can decide whether G has a 
proper 5-coloring in time f(5)-n° = O(n°). But then we have a polynomial- 
time algorithm for an NP-hard problem, implying P = NP. Observe that 
even an XP algorithm with running time f(k)-n9") for any functions f and 
g would imply that P = NP by an identical argument. 


A hard parameterized problem: finding cliques 


The example of VERTEX COLORING illustrates that parameterized algorithms 
are not all-powerful: there are parameterized problems that do not seem to 
admit FPT algorithms. But very importantly, in this specific example, we 
could explain very precisely why we are not able to design efficient algorithms, 
even when the number of bars is small. From the perspective of an algorithm 
designer such insight is very useful; she can now stop wasting time trying to 
design efficient algorithms based only on the fact that the number of bars is 
small, and start searching for other ways to attack the problem instances. If 
we are trying to make a polynomial-time algorithm for a problem and failing, 
it is quite likely that this is because the problem is NP-hard. Is the theory 
of NP-hardness the right tool also for giving negative evidence for fixed- 
parameter tractability? In particular, if we are trying to make an f(k) ns- 
time algorithm and fail to do so, is it because the problem is NP-hard for 
some fixed constant value of k, say k = 100? Let us look at another example 
problem. 

Now that you have a program that helps you decide who to refuse at the 
door and who to admit, you are faced with a different problem. The people in 
the town you live in have friends who might get upset if their friend is refused 
at the door. You are quite skilled at martial arts, and you can handle at most 
k — 1 angry guys coming at you, but probably not k. What you are most 
worried about are groups of at least k people where everyone in the group is 
friends with everyone else. These groups tend to have an “all for one and one 
for all” mentality — if one of them gets mad at you, they all do. Small as the 
town is, you know exactly who is friends with whom, and you want to figure 
out whether there is a group of at least k people where everyone is friends 
with everyone else. You model this as a graph problem where every person 
is a vertex and two vertices are connected by an edge if the corresponding 
persons are friends. What you are looking for is a clique on k vertices, that 
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is, a set of k vertices with an edge between every pair of them. This problem 
is known as the CLIQUE problem. For example, if we interpret now the edges 
of Fig. [LI] as showing friendships between people, then Bob, Christos, and 
Daniel form a clique of size 3. 

There is a simple O(n*)-time algorithm to check whether a clique on at 


least k vertices exists; for each of the (7) = OG) subsets of vertices of 
size k, we check in time O(k?) whether every pair of vertices in the subset 
is adjacent. Unfortunately, this XP algorithm is quite hopeless to run for 
n = 1000 and k = 10. Can we design an FPT algorithm for this problem? 
So far, no one has managed to find one. Could it be that this is because 
finding a k-clique is NP-hard for some fixed value of k? Suppose the problem 
was NP-hard for k = 100. We just gave an algorithm for finding a clique of 
size 100 in time O(n!°°), which is polynomial time. We would then have a 
polynomial-time algorithm for an NP-hard problem, implying that P = NP. 
So we cannot expect to be able to use NP-hardness in this way in order to 
rule out an FPT algorithm for CLIQUE. More generally, it seems very difficult 
to use NP-hardness in order to explain why a problem that does have an XP 
algorithm does not admit an FPT algorithm. 

Since NP-hardness is insufficient to differentiate between problems with 
f(k)-n9)-time algorithms and problems with f(k) -n°-time algorithms, we 
resort to stronger complexity theoretical assumptions. The theory of W[1]- 
hardness (see Chapter [13) allows us to prove (under certain complexity as- 
sumptions) that even though a problem is polynomial-time solvable for every 
fixed k, the parameter k has to appear in the exponent of n in the running 
time, that is, the problem is not FPT. This theory has been quite successful 
for identifying which parameterized problems are FPT and which are unlikely 
to be. Besides this qualitative classification of FPT versus W[1]-hard, more 
recent developments give us also (an often surprisingly tight) quantitative 
understanding of the time needed to solve a parameterized problem. Under 
reasonable assumptions about the hardness of CNF-SAT (see Chapter {14}, 
it is possible to show that there is no f(k)- n°, or even a f(k) - n°)-time 
algorithm for finding a clique on k vertices. Thus, up to constant factors 
in the exponent, the naive O(n*)-time algorithm is optimal! Over the past 
few years, it has become a rule, rather than an exception, that whenever 
we are unable to significantly improve the running time of a parameterized 
algorithm, we are able to show that the existing algorithms are asymptoti- 
cally optimal, under reasonable assumptions. For example, under the same 
assumptions that we used to rule out an f(k)-n°")-time algorithm for solv- 
ing CLIQUE, we can also rule out a 2°") . 2° )-time algorithm for the BAR 
FIGHT PREVENTION problem from the beginning of this chapter. 


Any algorithmic theory is incomplete without an accompanying com- 
plexity theory that establishes intractability of certain problems. There 
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Problem Good news Bad news 





Bar Fight PREVENTION O(2*-k-n)-time algorithm NP-hard 

(probably not in P) 
CLIQUE with A O(24 . A? . n)-time algorithm NP-hard 

(probably not in P) 


CLIQUE with k nO(*) time algorithm W[1]-hard 
(probably not FPT) 


VERTEX COLORING NP-hard for k = 3 
(probably not XP) 


Fig. 1.3: Overview of the problems in this chapter 


is such a complexity theory providing lower bounds on the running time 
required to solve parameterized problems. 


Finding cliques — with a different parameter 


OK, so there probably is no algorithm for solving CLIQUE with running time 
f(k)-n°\). But what about those scary groups of people that might come for 
you if you refuse the wrong person at the door? They do not care at all about 
the computational hardness of CLIQUE, and neither do their fists. What can 
you do? Well, in Norway most people do not have too many friends. In fact, 
it is quite unheard of that someone has more than A = 20 friends. That 
means that we are trying to find a k-clique in a graph of maximum degree 
A. This can be done quite efficiently: if we guess one vertex v in the clique, 
then the remaining vertices in the clique must be among the A neighbors of 
v. Thus we can try all of the 24 subsets of the neighbors of v, and return 
the largest clique that we found. The total running time of this algorithm 
is O(24 . A? - n), which is quite feasible for A = 20. Again it is possible to 
use complexity theoretic assumptions on the hardness of CNF-SAT to show 
that this algorithm is asymptotically optimal, up to multiplicative constants 
in the exponent. 

What the algorithm above shows is that the CLIQUE problem is FPT when 
the parameter is the maximum degree A of the input graph. At the same time 
CLIQUE is probably not FPT when the parameter is the solution size k. Thus, 
the classification of the problem into “tractable” or “intractable” crucially 
depends on the choice of parameter. This makes a lot of sense; the more we 
know about our input instances, the more we can exploit algorithmically! 
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The art of parameterization 


For typical optimization problems, one can immediately find a relevant pa- 
rameter: the size of the solution we are looking for. In some cases, however, 
it is not completely obvious what we mean by the size of the solution. For 
example, consider the variant of BAR FIGHT PREVENTION where we want 
to reject. at most k guests such that the number of conflicts is reduced to at 
most Z (as we believe that the bouncers at the bar can handle £ conflicts, 
but not more). Then we can parameterize either by k or by £. We may even 
parameterize by both: then the goal is to find an FPT algorithm with running 
time f(k, 2) - n° for some computable function f depending only on k and £. 
Thus the theory of parameterization and FPT algorithms can be extended to 
considering a set of parameters at the same time. Formally, however, one can 
express parameterization by k and £ simply by defining the value k + £ to be 
the parameter: an f(k,£) -n° algorithm exists if and only if an f(k +£) -n° 
algorithm exists. 

The parameters k and £ in the extended BAR FIGHT PREVENTION exam- 
ple of the previous paragraph are related to the objective of the problem: they 
are parameters explicitly given in the input, defining the properties of the so- 
lution we are looking for. We get more examples of this type of parameter if 
we define variants of BAR FIGHT PREVENTION where we need to reject at 
most k guests such that, say, the number of conflicts decreases by p, or such 
that each accepted guest has conflicts with at most d other accepted guests, 
or such that the average number of conflicts per guest is at most a. Then 
the parameters p, d, and a are again explicitly given in the input, telling us 
what kind of solution we need to find. The parameter A (maximum degree 
of the graph) in the CLIQUE example is a parameter of a very different type: 
it is not given explicitly in the input, but it is a measure of some property of 
the input instance. We defined and explored this particular measure because 
we believed that it is typically small in the input instances we care about: 
this parameter expresses some structural property of typical instances. We 
can identify and investigate any number of such parameters. For example, 
in problems involving graphs, we may parameterize by any structural pa- 
rameter of the graph at hand. Say, if we believe that the problem is easy on 
planar graphs and the instances are “almost planar”, then we may explore the 
parameterization by the genus of the graph (roughly speaking, a graph has 
genus g if it can be drawn without edge crossings on a sphere with g holes in 
it). A large part of Chapter |7| (and also Chapter [11) is devoted to parame- 
terization by treewidth, which is a very important parameter measuring the 
“tree-likeness” of the graph. For problems involving satisfiability of Boolean 
formulas, we can have such parameters as the number of variables, or clauses, 
or the number of clauses that need to be satisfied, or that are allowed not 
to be satisfied. For problems involving a set of strings, one can parameter- 
ize by the maximum length of the strings, by the size of the alphabet, by 
the maximum number of distinct symbols appearing in each string, etc. In 
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a problem involving a set of geometric objects (say, points in space, disks, 
or polygons), one may parameterize by the maximum number of vertices of 
each polygon or the dimension of the space where the problem is defined. For 
each problem, with a bit of creativity, one can come up with a large number 
of (combinations of) parameters worth studying. 


For the same problem there can be multiple choices of parameters. Se- 
lecting the right parameter(s) for a particular problem is an art. 


Parameterized complexity allows us to study how different parameters in- 
fluence the complexity of the problem. A successful parameterization of a 
problem needs to satisfy two properties. First, we should have some rea- 
son to believe that the selected parameter (or combination of parameters) 
is typically small on input instances in some application. Second, we need 
efficient algorithms where the combinatorial explosion is restricted to the 
parameter(s), that is, we want the problem to be FPT with this parameter- 
ization. Finding good parameterizations is an art on its own and one may 
spend quite some time on analyzing different parameterizations of the same 
problem. However, in this book we focus more on explaining algorithmic tech- 
niques via carefully chosen illustrative examples, rather than discussing every 
possible aspect of a particular problem. Therefore, even though different pa- 
rameters and parameterizations will appear throughout the book, we will not 
try to give a complete account of all known parameterizations and results for 
any concrete problem. 


1.1 Formal definitions 


We finish this chapter by leaving the realm of pub jokes and moving to more 
serious matters. Before we start explaining the techniques for designing pa- 
rameterized algorithms, we need to introduce formal foundations of param- 
eterized complexity. That is, we need to have rigorous definitions of what a 
parameterized problem is, and what it means that a parameterized problem 
belongs to a specific complexity class. 


Definition 1.1. A parameterized problem is a language L C X* x N, where 
X is a fixed, finite alphabet. For an instance (x, k) € X* x N, k is called the 
parameter. 


For example, an instance of CLIQUE parameterized by the solution size is 
a pair (G, k), where we expect G to be an undirected graph encoded as a 
string over X, and k is a positive integer. That is, a pair (G, k) belongs to the 
CLIQUE parameterized language if and only if the string G correctly encodes 
an undirected graph, which we will also denote by G, and moreover the graph 
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G contains a clique on k vertices. Similarly, an instance of the CNF-SAT 
problem (satisfiability of propositional formulas in CNF), parameterized by 
the number of variables, is a pair (y,n), where we expect y to be the input 
formula encoded as a string over X and n to be the number of variables of 
p. That is, a pair (p,n) belongs to the CNF-SAT parameterized language 
if and only if the string y correctly encodes a CNF formula with n variables, 
and the formula is satisfiable. 

We define the size of an instance (z,k) of a parameterized problem as 
|x| + k. One interpretation of this convention is that, when given to the 
algorithm on the input, the parameter k is encoded in unary. 


Definition 1.2. A parameterized problem L C X* x N is called fixed- 
parameter tractable (FPT) if there exists an algorithm A (called a fixed- 
parameter algorithm), a computable function f: N — N, and a constant 
c such that, given (x, k) € X* x N, the algorithm A correctly decides 
whether (x, k) € L in time bounded by f(k) - |(x, &)|°. The complexity 
class containing all fixed-parameter tractable problems is called FPT. 


Before we go further, let us make some remarks about the function f in 
this definition. Observe that we assume f to be computable, as otherwise we 
would quickly run into trouble when developing complexity theory for fixed- 
parameter tractability. For technical reasons, it will be convenient to assume, 
from now on, that f is also nondecreasing. Observe that this assumption 
has no influence on the definition of fixed-parameter tractability as stated in 
Definition since for every computable function f: N — N there exists a 
computable nondecreasing function f that is never smaller than f: we can 
simply take f(k) = max;=o,1,...,4 f(). Also, for standard algorithmic results 
it is always the case that the bound on the running time is a nondecreasing 
function of the complexity measure, so this assumption is indeed satisfied in 
practice. However, the assumption about f being nondecreasing is formally 
needed in various situations, for example when performing reductions. 

We now define the complexity class XP. 


Definition 1.3. A parameterized problem L C X* x N is called slice-wise 
polynomial (XP) if there exists an algorithm A and two computable functions 
f,g: N > N such that, given (x, k) € X* x N, the algorithm A correctly de- 
cides whether (x, k) € L in time bounded by f(k)-|(x, k)|9. The complexity 
class containing all slice-wise polynomial problems is called XP. 


Again, we shall assume that the functions f,g in this definition are nonde- 
creasing. 

The definition of a parameterized problem, as well as the definitions of 
the classes FPT and XP, can easily be generalized to encompass multiple 
parameters. In this setting we simply allow k to be not just one nonnegative 
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integer, but a vector of d nonnegative integers, for some fixed constant d. 
Then the functions f and g in the definitions of the complexity classes FPT 
and XP can depend on all these parameters. 

Just as “polynomial time” and “polynomial-time algorithm” usually refer 
to time polynomial in the input size, the terms “FPT time” and “FPT algo- 
rithms” refer to time f(k) times a polynomial in the input size. Here f is a 
computable function of k and the degree of the polynomial is independent of 
both n and k. The same holds for “XP time” and “XP algorithms”, except 
that here the degree of the polynomial is allowed to depend on the parameter 
k, as long as it is upper bounded by g(k) for some computable function g. 

Observe that, given some parameterized problem L, the algorithm de- 
signer has essentially two different optimization goals when designing FPT 
algorithms for L. Since the running time has to be of the form f(k) - n°, one 
can: 


e optimize the parametric dependence of the running time, i.e., try to design 
an algorithm where function f grows as slowly as possible; or 

e optimize the polynomial factor in the running time, i.e., try to design an 
algorithm where constant c is as small as possible. 


Both these goals are equally important, from both a theoretical and a practi- 
cal point of view. Unfortunately, keeping track of and optimizing both factors 
of the running time can be a very difficult task. For this reason, most research 
on parameterized algorithms concentrates on optimizing one of the factors, 
and putting more focus on each of them constitutes one of the two dominant 
trends in parameterized complexity. Sometimes, when we are not interested in 
the exact value of the polynomial factor, we use the O*-notation, which sup- 
presses factors polynomial in the input size. More precisely, a running time 
O*(f(k)) means that the running time is upper bounded by f(k) - n°, 
where n is the input size. 

The theory of parameterized complexity has been pioneered by Downey 
and Fellows over the last two decades [153]. The main 
achievement of their work is a comprehensive complexity theory for param- 
eterized problems, with appropriate notions of reduction and completeness. 
The primary goal is to understand the qualitative difference between fixed- 
parameter tractable problems, and problems that do not admit such effi- 
cient algorithms. The theory contains a rich “positive” toolkit of techniques 
for developing efficient parameterized algorithms, as well as a correspond- 
ing “negative” toolkit that supports a theory of parameterized intractability. 
This textbook is mostly devoted to a presentation of the positive toolkit: in 
Chapters [2] through [12] we present various algorithmic techniques for design- 
ing fixed-parameter tractable algorithms. As we have argued, the process of 
algorithm design has to use both toolkits in order to be able to conclude that 
certain research directions are pointless. Therefore, in Part [ITT] we give an 
introduction to lower bounds for parameterized problems. 
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Downey and Fellows laid the foundation of parameterized complexity in the series of pa- 
pers {150} [151]. The classic reference on parameterized complexity is the book of 
Downey and Fellows [153]. The new edition of this book is a comprehensive overview 
of the state of the art in many areas of parameterized complexity. The book of Flum 
and Grohe [189] is an extensive introduction to the area with a strong emphasis on the 
complexity viewpoint. An introduction to basic algorithmic techniques in parameterized 
complexity up to 2006 is given in the book of Niedermeier |376|. The recent book 
contains a collection of surveys on different areas of parameterized complexity. 


Chapter 2 
Kernelization 


Kernelization is a systematic approach to study 
polynomial-time preprocessing algorithms. It is an 
important tool in the design of parameterized algo- 
rithms. In this chapter we explain basic kernelization 
techniques such as crown decomposition, the expan- 
sion lemma, the sunflower lemma, and linear pro- 
gramming. We illustrate these techniques by obtain- 
ing kernels for VERTEX COVER, FEEDBACK ARC SET 
IN TOURNAMENTS, EDGE CLIQUE COVER, MAXIMUM 
SATISFIABILITY, and d-HITTING SET. 





Preprocessing (data reduction or kernelization) is used universally in al- 
most every practical computer implementation that aims to deal with an NP- 
hard problem. The goal of a preprocessing subroutine is to solve efficiently 
the “easy parts” of a problem instance and reduce it (shrink it) to its com- 
putationally difficult “core” structure (the problem kernel of the instance). In 
other words, the idea of this method is to reduce (but not necessarily solve) 
the given problem instance to an equivalent “smaller sized” instance in time 
polynomial in the input size. A slower exact algorithm can then be run on 
this smaller instance. 

How can we measure the effectiveness of such a preprocessing subrou- 
tine? Suppose we define a useful preprocessing algorithm as one that runs 
in polynomial time and replaces an instance J with an equivalent instance 
that is at least one bit smaller. Then the existence of such an algorithm for 
an NP-hard problem would imply P= NP, making it unlikely that such an 
algorithm can be found. For a long time, there was no other suggestion for 
a formal definition of useful preprocessing, leaving the mathematical analy- 
sis of polynomial-time preprocessing algorithms largely neglected. But in the 
language of parameterized complexity, we can formulate a definition of use- 
ful preprocessing by demanding that large instances with a small parameter 
should be shrunk, while instances that are small compared to their parameter 
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do not have to be processed further. These ideas open up the “lost continent” 
of polynomial-time algorithms called kernelization. 

In this chapter we illustrate some commonly used techniques to design 
kernelization algorithms through concrete examples. The next section, Sec- 
tion 2.1] provides formal definitions. In Section [2.2]we give kernelization algo- 
rithms based on so-called natural reduction rules. Section 2.3] introduces the 
concepts of crown decomposition and the expansion lemma, and illustrates 
it on MAXIMUM SATISFIABILITY. Section [2.5] studies tools based on linear 
programming and gives a kernel for VERTEX COVER. Finally, we study the 
sunflower lemma in Section [2.6] and use it to obtain a polynomial kernel for 
d-HITTING SET. 


2.1 Formal definitions 


We now turn to the formal definition that captures the notion of kerneliza- 
tion. A data reduction rule, or simply, reduction rule, for a parameterized 
problem Q is a function ¢: X* x N — X* x N that maps an instance (J, k) 
of Q to an equivalent instance (J’,k’) of Q such that ¢ is computable in 
time polynomial in || and k. We say that two instances of Q are equivalent 
if (I, k) € Q if and only if (J’,k’) € Q; this property of the reduction rule ¢, 
that it translates an instance to an equivalent one, is sometimes referred to 
as the safeness or soundness of the reduction rule. In this book, we stick to 
the phrases: a rule is safe and the safeness of a reduction rule. 

The general idea is to design a preprocessing algorithm that consecutively 
applies various data reduction rules in order to shrink the instance size as 
much as possible. Thus, such a preprocessing algorithm takes as input an 
instance (I,k) € X* x N of Q, works in polynomial time, and returns an 
equivalent instance (I’,k’) of Q. In order to formalize the requirement that 
the output instance has to be small, we apply the main principle of Parame- 
terized Complexity: The complexity is measured in terms of the parameter. 
Consequently, the output size of a preprocessing algorithm A is a function 
size 4: N > NU {oo} defined as follows: 


size 4(k) = sup{|I"| +k’ : (’,k’) = AU,k), I € 2*}. 


In other words, we look at all possible instances of Q with a fixed parameter k, 
and measure the supremum of the sizes of the output of A on these instances. 
Note that this supremum may be infinite; this happens when we do not have 
any bound on the size of A(I, k) in terms of the input parameter k only. 
Kernelization algorithms are exactly these preprocessing algorithms whose 
output size is finite and bounded by a computable function of the parameter. 


Definition 2.1 (Kernelization, kernel). A kernelization algorithm, or 
simply a kernel, for a parameterized problem Q is an algorithm A that, given 
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an instance (I, k) of Q, works in polynomial time and returns an equivalent 
instance (I’,k’) of Q. Moreover, we require that size 4(k) < g(k) for some 
computable function g: NN. 


The size requirement in this definition can be reformulated as follows: 
There exists a computable function g(-) such that whenever (J’,k’) is the 
output for an instance (I, k), then it holds that |I| +k’ < g(k). If the upper 
bound g(-) is a polynomial (linear) function of the parameter, then we say 
that Q admits a polynomial (linear) kernel. We often abuse the notation and 
call the output of a kernelization algorithm the “reduced” equivalent instance, 
also a kernel. 

In the course of this chapter, we will often encounter a situation when 
in some boundary cases we are able to completely resolve the considered 
problem instance, that is, correctly decide whether it is a yes-instance or a 
no-instance. Hence, for clarity, we allow the reductions (and, consequently, 
the kernelization algorithm) to return a yes/no answer instead of a reduced 
instance. Formally, to fit into the introduced definition of a kernel, in such 
cases the kernelization algorithm should instead return a constant-size trivial 
yes-instance or no-instance. Note that such instances exist for every param- 
eterized language except for the empty one and its complement, and can be 
therefore hardcoded into the kernelization algorithm. 

Recall that, given an instance (I, k) of Q, the size of the kernel is defined 
as the number of bits needed to encode the reduced equivalent instance I’ 
plus the parameter value k’. However, when dealing with problems on graphs, 
hypergraphs, or formulas, often we would like to emphasize other aspects of 
output instances. For example, for a graph problem Q, we could say that Q 
admits a kernel with O(k?) vertices and O(k°) edges to emphasize the upper 
bound on the number of vertices and edges in the output instances. Similarly, 
for a problem defined on formulas, we could say that the problem admits a 
kernel with O(k) variables. 

It is important to mention here that the early definitions of kernelization 
required that k’ < k. On an intuitive level this makes sense, as the parame- 
ter k measures the complexity of the problem — thus the larger the k, the 
harder the problem. This requirement was subsequently relaxed, notably in 
the context of lower bounds. An advantage of the more liberal notion of ker- 
nelization is that it is robust with respect to polynomial transformations of 
the kernel. However, it limits the connection with practical preprocessing. 
All the kernels mentioned in this chapter respect the fact that the output 
parameter is at most the input parameter, that is, k’ < k. 


While usually in Computer Science we measure the efficiency of an 
algorithm by estimating its running time, the central measure of the 
efficiency of a kernelization algorithm is a bound on its output size. 
Although the actual running time of a kernelization algorithm is of 
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ten very important for practical applications, in theory a kernelization 
algorithm is only required to run in polynomial time. 


If we have a kernelization algorithm for a problem for which there is some 
algorithm (with any running time) to decide whether (J, k) is a yes-instance, 
then clearly the problem is FPT, as the size of the reduced instance I is 
simply a function of k (and independent of the input size n). However, a 
surprising result is that the converse is also true. 


Lemma 2.2. If a parameterized problem Q is FPT then it admits a kernel- 
ization algorithm. 


Proof. Since Q is FPT, there is an algorithm A deciding if (I, k) € Q in time 
f(k)-|Z|° for some computable function f and a constant c. We obtain a ker- 
nelization algorithm for Q as follows. Given an input (J, k), the kernelization 
algorithm runs A on (I, k), for at most |I|°t! steps. If it terminates with an 
answer, use that answer to return either that (I, k) is a yes-instance or that 
it is a no-instance. If A does not terminate within |J|°*! steps, then return 
(I, k) itself as the output of the kernelization algorithm. Observe that since 
A did not terminate in |J|°*! steps, we have that f(k) - |I|° > |Z|¢t', and 
thus |Z| < f(k). Consequently, we have |I| +k < f(k) +k, and we obtain a 
kernel of size at most f(k) +k; note that this upper bound is computable as 
f(k) is a computable function. 

















Lemma|2.2|implies that a decidable problem admits a kernel if and only 
if it is fixed-parameter tractable. Thus, in a sense, kernelization can be 
another way of defining fixed-parameter tractability. 


However, kernels obtained by this theoretical result are usually of expo- 
nential (or even worse) size, while problem-specific data reduction rules often 
achieve quadratic (g(k) = O(k?)) or even linear-size (g(k) = O(k)) kernels. 
So a natural question for any concrete FPT problem is whether it admits 
a problem kernel that is bounded by a polynomial function of the param- 
eter (g(k) = k°). In this chapter we give polynomial kernels for several 
problems using some elementary methods. In Chapter p] we give more ad- 
vanced methods for obtaining kernels. 


2.2 Some simple kernels 


In this section we give kernelization algorithms for VERTEX COVER and 
FEEDBACK ARC SET IN TOURNAMENTS (FAST) based on a few natural 
reduction rules. 
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2.2.1 VERTEX COVER 


Let G be a graph and S C V(G). The set S is called a vertex cover if for 
every edge of G at least one of its endpoints is in S. In other words, the 
graph G — S contains no edges and thus V(G) \ S is an independent set. In 
the VERTEX COVER problem, we are given a graph G and a positive integer 
k as input, and the objective is to check whether there exists a vertex cover 
of size at most k. 

The first reduction rule is based on the following simple observation. For 
a given instance (G,k) of VERTEX COVER, if the graph G has an isolated 
vertex, then this vertex does not cover any edge and thus its removal does 
not change the solution. This shows that the following rule is safe. 


Reduction VC.1. If G contains an isolated vertex v, delete v from G. The 
new instance is (G — v, k). 


The second rule is based on the following natural observation: 


If G contains a vertex v of degree more than k, then v should be in 
every vertex cover of size at most k. 


Indeed, this is because if v is not in a vertex cover, then we need at 
least k + 1 vertices to cover edges incident to v. Thus our second rule is the 
following. 


Reduction VC.2. If there is a vertex v of degree at least k + 1, then delete 
v (and its incident edges) from G and decrement the parameter k by 1. The 
new instance is (G — v, k — 1). 


Observe that exhaustive application of reductions and completely 
removes the vertices of degree 0 and degree at least k + 1. The next step is 
the following observation. 


If a graph has maximum degree d, then a set of k vertices can cover at 
most kd edges. 


This leads us to the following lemma. 


Lemma 2.3. If (G, k) is a yes-instance and none of the reduction rules| VC. 1 
is applicable to G, then |V(G)| < k? +k and |E(G)| < k?. 


Proof. Because we cannot apply Reductions anymore on G, G has 
no isolated vertices. Thus for every vertex cover S of G, every vertex of 
G — S should be adjacent to some vertex from S. Since we cannot apply 
Reductions every vertex of G has degree at most k. It follows that 
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|V(G — S)| < k|S| and hence |V(G)| < (k + 1)|S|. Since (G, k) is a yes- 
instance, there is a vertex cover S of size at most k, so |V(G)| < (k + 1)k. 
Also every edge of G is covered by some vertex from a vertex cover and every 
vertex can cover at most k edges. Hence if G has more than k? edges, this is 
again a no-instance. 














Lemmaj2.3jallows us to claim the final reduction rule that explicitly bounds 
the size of the kernel. 


Reduction VC.3. Let (G, k) bean input instance such that Reductions[VC.1] 
and are not applicable to (G, k). If k < 0 and G has more than k* + k 
vertices, or more than k? edges, then conclude that we are dealing with a 
no-instance. 


Finally, we remark that all reduction rules are trivially applicable in linear 
time. Thus, we obtain the following theorem. 


Theorem 2.4. VERTEX COVER admits a kernel with O(k?) vertices and 
O(k?) edges. 


2.2.2 FEEDBACK ARC SET IN TOURNAMENTS 


In this section we discuss a kernel for the FEEDBACK ARC SET IN TOURNA- 
MENTS problem. A tournament is a directed graph T such that for every pair 
of vertices u,v € V(T), exactly one of (u,v) or (v, u) is a directed edge (also 
often called an arc) of T. A set of edges A of a directed graph G is called a 
feedback arc set if every directed cycle of G contains an edge from A. In other 
words, the removal of A from G turns it into a directed acyclic graph. Very 
often, acyclic tournaments are called transitive (note that then E(G) is a 
transitive relation). In the FEEDBACK ARC SET IN TOURNAMENTS problem 
we are given a tournament T and a nonnegative integer k. The objective is 
to decide whether T has a feedback arc set of size at most k. 

For tournaments, the deletion of edges results in directed graphs which 
are not tournaments anymore. Because of that, it is much more convenient 
to use the characterization of a feedback arc set in terms of “reversing edges”. 
We start with the following well-known result about topological orderings of 
directed acyclic graphs. 


Lemma 2.5. A directed graph G is acyclic if and only if it is possible to 
order its vertices in such a way such that for every directed edge (u,v), we 
have u < v. 


We leave the proof of Lemma [2.5] as an exercise; see Exercise [2.1] Given 
a directed graph G and a subset F C E(G) of edges, we define G ® F to be 
the directed graph obtained from G by reversing all the edges of F. That is, 
if rev(F) = {(u,v) : (v,u) E€ F}, then for G ® F the vertex set is V(G) 
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and the edge set E(G ® F) = (E(G) Urev(F)) \ F. Lemma [2.5] implies the 
following. 


Observation 2.6. Let G be a directed graph and let F be a subset of edges 
of G. If G@ F is a directed acyclic graph then F is a feedback arc set of G. 


The following lemma shows that, in some sense, the opposite direction 
of the statement in Observation is also true. However, the minimality 
condition in Lemma [2.7] is essential, see Exercise [2.2] 


Lemma 2.7. Let G be a directed graph and F be a subset of E(G). Then 
F is an inclusion-wise minimal feedback arc set of G if and only if F is an 
inclusion-wise minimal set of edges such that G ® F is an acyclic directed 
graph. 


Proof. We first prove the forward direction of the lemma. Let F be an 
inclusion-wise minimal feedback arc set of G. Assume to the contrary that 
G@F has a directed cycle C. Then C cannot contain only edges of E(G)\F, as 
that would contradict the fact that F is a feedback arc set. Let f1, fo,--- , fe 
be the edges of C N rev(F) in the order of their appearance on the cycle C, 
and let e; € F be the edge f; reversed. Since F is inclusion-wise minimal, 
for every e;, there exists a directed cycle C; in G such that F N C; = {e;}. 
Now consider the following closed walk W in G: we follow the cycle C, but 
whenever we are to traverse an edge f; € rev(F) (which is not present in 
G), we instead traverse the path C; — e;. By definition, W is a closed walk 
in G and, furthermore, note that W does not contain any edge of F. This 
contradicts the fact that F is a feedback arc set of G. 

The minimality follows from Observation That is, every set of edges 
F such that G ® F is acyclic is also a feedback arc set of G, and thus, if F is 
not a minimal set such that G ® F is acyclic, then it will contradict the fact 
that F is a minimal feedback arc set. 

For the other direction, let F be an inclusion-wise minimal set of edges 
such that G ® F is an acyclic directed graph. By Observation Fisa 
feedback arc set of Œ. Moreover, F is an inclusion-wise minimal feedback arc 
set, because if a proper subset F’ of F is an inclusion-wise minimal feedback 
arc set of G, then by the already proved implication of the lemma, G ® F” is 
an acyclic directed graph, a contradiction with the minimality of F. 














We are ready to give a kernel for FEEDBACK ARC SET IN TOURNAMENTS. 


Theorem 2.8. FEEDBACK ARC SET IN TOURNAMENTS admits a kernel with 
at most k? + 2k vertices. 


Proof. Lemma implies that a tournament T has a feedback arc set of 
size at most k if and only if it can be turned into an acyclic tournament by 
reversing directions of at most k edges. We will use this characterization for 
the kernel. 

In what follows by a triangle we mean a directed cycle of length three. We 
give two simple reduction rules. 
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Reduction FAST.1. If an edge e is contained in at least k + 1 triangles, 
then reverse e and reduce k by 1. 


Reduction FAST.2. If a vertex v is not contained in any triangle, then 
delete v from T. 


The rules follow similar guidelines as in the case of VERTEX COVER. In 
Reduction [FAST. 1] we greedily take into a solution an edge that partic- 
ipates in k + 1 otherwise disjoint forbidden structures (here, triangles). 
In Reduction|FAST.2] we discard vertices that do not participate in any 
forbidden structure, and should be irrelevant to the problem. 

However, a formal proof of the safeness of Reduction [FAST.2]is not 
immediate: we need to verify that deleting v and its incident edges does 
not make make a yes-instance out of a no-instance. 


Note that after applying any of the two rules, the resulting graph is again 
a tournament. The first rule is safe because if we do not reverse e, we have 
to reverse at least one edge from each of k + 1 triangles containing e. Thus e 
belongs to every feedback arc set of size at most k. 

Let us now prove the safeness of the second rule. Let X = N*(v) be the 
set of heads of directed edges with tail v and let Y = N- (v) be the set of 
tails of directed edges with head v. Because T is a tournament, X and Y isa 
partition of V(T)\{v}. Since v is not a part of any triangle in T, we have that 
there is no edge from X to Y (with head in Y and tail in X). Consequently, 
for any feedback arc set A; of tournament T|X] and any feedback arc set 
Ay of tournament T[Y], the set A, U Ag is a feedback arc set of T. As the 
reverse implication is trivial (for any feedback arc set A in T, AN E(T[X]) is 
a feedback arc set of T|X], and AN E(T[Y]) is a feedback arc set of T[Y]), 
we have that (T, k) is a yes-instance if and only if (T — v, k) is. 

Finally, we show that every reduced yes-instance T, an instance on which 
none of the presented reduction rules are applicable, has at most k(k + 2) 
vertices. Let A be a feedback arc set of a reduced instance T of size at most 
k. For every edge e € A, aside from the two endpoints of e, there are at most 
k vertices that are in triangles containing e — otherwise we would be able 
to apply Reduction [FAST.1] Since every triangle in T contains an edge of A 
and every vertex of T is in a triangle, we have that T has at most k(k + 2) 
vertices. 

Thus, given (T,k) we apply our reduction rules exhaustively and obtain 
an equivalent instance (T’,k’). If T’ has more than k’? + k’ vertices, then the 
algorithm returns that (T,k) is a no-instance, otherwise we get the desired 
kernel. This completes the proof of the theorem. 
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2.2.3 EDGE CLIQUE COVER 


Not all FPT problems admit polynomial kernels. In the EDGE CLIQUE 
COVER problem, we are given a graph G and a nonnegative integer k, and 
the goal is to decide whether the edges of G can be covered by at most 
k cliques. In this section we give an exponential kernel for EDGE CLIQUE 
Cover. In Theorem of Section we remark that this simple 
kernel is essentially optimal. 

Let us recall the reader that we use N(v) = {u : uv € E(G)} to denote 
the neighborhood of vertex v in G, and N[v] = N(v) U {v} to denote the 
closed neighborhood of v. We apply the following data reduction rules in the 
given order (i.e., we always use the lowest-numbered rule that modifies the 
instance). 


Reduction ECC.1. Remove isolated vertices. 


Reduction ECC.2. If there is an isolated edge uv (a connected component 
that is just an edge), delete it and decrease k by 1. The new instance is 


(G — {u,v}, k — 1). 


Reduction ECC.3. If there is an edge uv whose endpoints have exactly the 
same closed neighborhood, that is, N[u] = N[v], then delete v. The new 
instance is (G — v, k). 


The crux of the presented kernel for EDGE CLIQUE COVER is an obser- 
vation that two true twins (vertices u and v with N[u] = N[v]) can be 
treated in exactly the same way in some optimum solution, and hence 
we can reduce them. Meanwhile, the vertices that are contained in ex- 
actly the same set of cliques in a feasible solution have to be true twins. 
This observation bounds the size of the kernel. 


The safeness of the first two reductions is trivial, while the safeness of 
Reduction [ECC.3|follows from the observation that a solution in G — v can 
be extended to a solution in G by adding v to all the cliques containing u 


(see Exercise [2.3). 


Theorem 2.9. EDGE CLIQUE COVER admits a kernel with at most 2" ver- 
tices. 


Proof. We start with the following claim. 


Claim. If (G,k) is a reduced yes-instance, on which none of the presented 
reduction rules can be applied, then |V(G)| < 2*. 


Proof. Let C1,...,C, be an edge clique cover of G. We claim that G has at 
most 2" vertices. Targeting a contradiction, let us assume that G has more 
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than 2* vertices. We assign to each vertex v € V(G) a binary vector b, of 
length k, where bit i, 1 < i < k, is set to 1 if and only if v is contained in clique 
Ci. Since there are only 2" possible vectors, there must be u Æ v € V(G) 
with b, = by. If b, and b, are zero vectors, the first rule applies; otherwise, 
u and v are contained in the same cliques. This means that u and v are 
adjacent and have the same neighborhood; thus either Reduction [ECC.3] or 
Reduction |ECC.3| applies. Hence, if G has more than 2" vertices, at least one 
of the reduction rules can be applied to it, which is a contradiction to the 
initial assumption that G is reduced. This completes the proof of the claim. 














The kernelization algorithm works as follows. Given an instance (G,k), it 


applies Reductions JECC.1| |ECC.2] and |ECC.3] exhaustively. If the resulting 


graph has more than 2” vertices the kernelization algorithm outputs that the 
input instance is a no-instance, else it outputs the reduced instance. 














2.3 Crown decomposition 


Crown decomposition is a general kernelization technique that can be used 
to obtain kernels for many problems. The technique is based on the classical 
matching theorems of Kőnig and Hall. 

Recall that for disjoint vertex subsets U,W of a graph G, a matching M 
is called a matching of U into W if every edge of M connects a vertex of U 
and a vertex of W and, moreover, every vertex of U is an endpoint of some 
edge of M. In this situation, we also say that M saturates U. 


Definition 2.10 (Crown decomposition). A crown decomposition of a 
graph G is a partitioning of V(G) into three parts C, H and R, such that 


1. C is nonempty. 

2. C is an independent set. 

3. There are no edges between vertices of C and R. That is, H separates C 
and R. 

4. Let E’ be the set of edges between vertices of C and H. Then F’ contains 
a matching of size |H]. In other words, G contains a matching of H into 


C. 


The set C can be seen as a crown put on head H of the remaining part R, see 
Fig. Note that the fact that E’ contains a matching of size |H| implies 
that there is a matching of H into C. This is a matching in the subgraph G”, 
with the vertex set C U H and the edge set F’, saturating all the vertices of 
H. 

For finding a crown decomposition in polynomial time, we use the following 
well known structural and algorithmic results. The first is a mini-max theorem 
due to Kőnig. 
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Fig. 2.1: Example of a crown decomposition. Set C is an independent set, H 
separates C and R, and there is a matching of H into C 


Theorem 2.11 (K6nig’s theorem, [303]). In every undirected bipartite 
graph the size of a maximum matching is equal to the size of a minimum 
vertex cover. 


Let us recall that a matching saturates a set of vertices S when every 
vertex in S is incident to an edge in the matching. The second classic result 
states that in bipartite graphs, a trivial necessary condition for the existence 
of a matching is also sufficient. 


Theorem 2.12 (Hall’s theorem, ). Let G be an undirected bipartite 
graph with bipartition (Vi, V2). The graph G has a matching saturating V, if 
and only if for all X C Vi, we have |N(X)| > |X|. 


The following theorem is due to Hopcroft and Karp [268]. The proof of 
the (nonstandard) second claim of the theorem is deferred to Exercise 2.21] 


Theorem 2.13 (Hopcroft-Karp algorithm, [268]). Let G be an undi- 
rected bipartite graph with bipartition Vı and V2, on n vertices and m edges. 
Then we can find a maximum matching as well as a minimum vertex cover 
of G in time O(m/n). Furthermore, in time O(m/n) either we can find a 
matching saturating Vy or an inclusion-wise minimal set X C Vi such that 
IN(X)| < |X]. 


The following lemma is the basis for kernelization algorithms using crown 
decomposition. 


Lemma 2.14 (Crown lemma). Let G be a graph without isolated vertices 
and with at least 3k + 1 vertices. There is a polynomial-time algorithm that 
either 


e finds a matching of size k+ 1 in G; or 
e finds a crown decomposition of G. 


Proof. We first find an inclusion-maximal matching M in G. This can be 
done by a greedy algorithm. If the size of M is k + 1, then we are done. 
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Hence, we assume that |M| < k, and let Vm be the endpoints of M. We have 
|Vm| < 2k. Because M is a maximal matching, the remaining set of vertices 
I = V (G) \ Vm is an independent set. 

Consider the bipartite graph G7,y,, formed by edges of G between Vm 
and J. We compute a minimum-sized vertex cover X and a maximum sized 
matching M’ of the bipartite graph Gy,y,, in polynomial time using Theo- 
rem We can assume that |M'| < k, for otherwise we are done. Since 
|X| = |M’| by K6nig’s theorem (Theorem |2.11), we infer that |X| < k. 

If no vertex of X is in Vm, then X C J. We claim that X = I. For 
a contradiction assume that there is a vertex w € I \ X. Because G has no 
isolated vertices there is an edge, say wz, incident to w in GI, vy- Since GI Vm 
is bipartite, we have that z € Vm. However, X is a vertex cover of GI Vm 
such that X N Vm = 0, which implies that w € X. This is contrary to our 
assumption that w ¢ X, thus proving that X = I. But then |I| < |X| < k, 
and G has at most 

M| + |Vu| <k+2k = 3k 


vertices, which is a contradiction. 

Hence, XOVm 4 0. We obtain a crown decomposition (C, H, R) as follows. 
Since |X| = |M’|, every edge of the matching M’ has exactly one endpoint 
in X. Let M* denote the subset of M’ such that every edge from M* has 
exactly one endpoint in X N Vm and let Vm» denote the set of endpoints of 
edges in M*. We define head H = X N Vm = X N Vm=, crown C = Vee OT, 
and the remaining part R = V (G) \ (CU H) = V(G) \ Vm=. In other words, 
H is the set of endpoints of edges of M* that are present in Vm and C is 
the set of endpoints of edges of M* that are present in J. Obviously, C is 
an independent set and by construction, M* is a matching of H into C. 
Furthermore, since X is a vertex cover of G;y,,, every vertex of C can be 
adjacent only to vertices of H and thus H separates C and R. This completes 
the proof. 














The crown lemma gives a relatively strong structural property of graphs 
with a small vertex cover (equivalently, a small maximum matching). If 
in a studied problem the parameter upper bounds the size of a vertex 
cover (maximum matching), then there is a big chance that the struc- 
tural insight given by the crown lemma would help in developing a small 
kernel — quite often with number of vertices bounded linearly in the 
parameter. 


We demonstrate the application of crown decompositions on kernelization 
for VERTEX COVER and MAXIMUM SATISFIABILITY. 
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2.3.1 VERTEX COVER 


Consider a VERTEX COVER instance (G,k). By an exhaustive application of 
Reduction [VC.1] we may assume that G has no isolated vertices. If |V(G)| > 
3k, we may apply the crown lemma to the graph G and integer k, obtaining 
either a matching of size k +1, or a crown decomposition V (G) = CUHUR. 
In the first case, the algorithm concludes that (G, k) is a no-instance. 

In the latter case, let M be a matching of H into C. Observe that the 
matching M witnesses that, for every vertex cover X of the graph G, X 
contains at least || = |H| vertices of H UC to cover the edges of M. On 
the other hand, the set H covers all edges of G that are incident to H UC. 
Consequently, there exists a minimum vertex cover of G that contains H, and 
we may reduce (G, k) to (G—H, k—|H]|). Note that in the instance (G— H, k— 
|H|), the vertices of C are isolated and will be reduced by Reduction [VC.1] 

As the crown lemma promises us that H 4 0, we can always reduce the 
graph as long as |V(G)| > 3k. Thus, we obtain the following. 


Theorem 2.15. VERTEX COVER admits a kernel with at most 3k vertices. 


2.3.2 MAXIMUM SATISFIABILITY 


For a second application of the crown decomposition, we look at the following 
parameterized version of MAXIMUM SATISFIABILITY. Given a CNF formula 
F, and a nonnegative integer k, decide whether F has a truth assignment 
satisfying at least k clauses. 


Theorem 2.16. MAXIMUM SATISFIABILITY admits a kernel with at most k 
variables and 2k clauses. 


Proof. Let p be a CNF formula with n variables and m clauses. Let w be an 
arbitrary assignment to the variables and let ~y be the assignment obtained 
by complementing the assignment of 7. That is, if Y assigns 6 € {T, L} to 
some variable x then =y assigns 76 to x. Observe that either Y or ~y satisfies 
at least m/2 clauses, since every clause is satisfied by y% or —w (or by both). 
This means that, if m > 2k, then (y, k) is a yes-instance. In what follows we 
give a kernel with n < k variables. 

Let Go be the variable-clause incidence graph of y. That is, Gy is a bi- 
partite graph with bipartition (X,Y), where X is the set of the variables of 
y and Y is the set of clauses of y. In Go there is an edge between a variable 
x € X and a clause c € Y if and only if either x, or its negation, is in c. If 
there is a matching of X into Y in Gy, then there is a truth assignment sat- 
isfying at least |X| clauses: we can set each variable in X in such a way that 
the clause matched to it becomes satisfied. Thus at least |X| clauses are sat- 
isfied. Hence, in this case, if k < |X|, then (y, k) is a yes-instance. Otherwise, 


30 2 Kernelization 


k > |X| =n, and we get the desired kernel. We now show that, if y has at 
least n > k variables, then we can, in polynomial time, either reduce y to an 
equivalent smaller instance, or find an assignment to the variables satisfying 
at least k clauses (and conclude that we are dealing with a yes-instance). 

Suppose vy has at least k variables. Using Hall’s theorem and a polynomial- 
time algorithm computing a maximum-size matching (Theorems [2.12|and[2.13}, 
we can in polynomial time find either a matching of X into Y or an inclusion- 
wise minimal set C C X such that |N(C)| < |C|. As discussed in the previous 
paragraph, if we found a matching, then the instance is a yes-instance and 
we are done. So suppose we found a set C as described. Let H be N(C) and 
R=V(G,)\(CUH). Clearly, N(C’) C H, there are no edges between vertices 
of C and R and G[C] is an independent set. Select an arbitrary x € C. We 
have that there is a matching of C \ {x} into H since |N(C’)| > |C’| for every 
C’ C C\ {x}. Since |C| > |H|, we have that the matching from C \ {x} to H 
is in fact a matching of H into C. Hence (C, H, R) is a crown decomposition 
of Gy. 

We prove that all clauses in H are satisfied in every assignment satisfying 
the maximum number of clauses. Indeed, consider any assignment w that does 
not satisfy all clauses in H. Fix any variable x € C. For every variable y in 
C\ {x} set the value of y so that the clause in H matched to y is satisfied. Let 
y’ be the new assignment obtained from 7 in this manner. Since N(C) C H 
and yw’ satisfies all clauses in H, more clauses are satisfied by 7)’ than by w. 
Hence wv cannot be an assignment satisfying the maximum number of clauses. 

The argument above shows that (y, k) is a yes-instance to MAXIMUM SAT- 
ISFIABILITY if and only if (y \ H, k — |H]|) is. This gives rise to the following 
simple reduction. 


Reduction MSat.1. Let (y, k) and H be as above. Then remove H from y 
and decrease k by |H|. That is, (p \ H, k — |H]|) is the new instance. 


Repeated applications of Reduction and the arguments described 
above give the desired kernel. This completes the proof of the theorem. 














2.4 Expansion lemma 


In the previous subsection, we described crown decomposition techniques 
based on the classical Hall’s theorem. In this section, we introduce a powerful 
variation of Hall’s theorem, which is called the expansion lemma. This lemma 
captures a certain property of neighborhood sets in graphs and can be used 
to obtain polynomial kernels for many graph problems. We apply this result 
to get an O(k?) kernel for FEEDBACK VERTEX SET in Chapter [9] 

A q-star, q > 1, is a graph with q+1 vertices, one vertex of degree q, called 
the center, and all other vertices of degree 1 adjacent to the center. Let G be 
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a bipartite graph with vertex bipartition (A, B). For a positive integer q, a 
set of edges M C E(G) is called by a q-expansion of A into B if 


e every vertex of A is incident to exactly q edges of M; 
e M saturates exactly q|A| vertices in B. 


Let us emphasize that a qg-expansion saturates all vertices of A. Also, for 
every u,v € A, u Æ v, the set of vertices E, adjacent to u by edges of M does 
not intersect the set of vertices E, adjacent to v via edges of M, see Fig. 
Thus every vertex v € A could be thought of as the center of a star with 
its q leaves in B, with all these |A| stars being vertex-disjoint. Furthermore, 
a collection of these stars is also a family of q edge-disjoint matchings, each 
saturating A. 


A 


Fig. 2.2: Set A has a 2-expansion into B 


Let us recall that, by Hall’s theorem (Theorem|2.12), a bipartite graph with 
bipartition (A, B) has a matching of A into B if and only if |N(X)| > |X| 
for all X C A. The following lemma is an extension of this result. 


Lemma 2.17. Let G be a bipartite graph with bipartition (A, B). Then there 
is a q-expansion from A into B if and only if |N(X)| > q|X| for every X C A. 
Furthermore, if there is no q-expansion from A into B, then a set XCA 
with |N(X)| < q|X| can be found in polynomial time. 


Proof. If A has a g-expansion into B, then trivially |N(X)| > q|X| for every 
XCA. 

For the opposite direction, we construct a new bipartite graph G’ with 
bipartition (A’, B) from G by adding (q — 1) copies of all the vertices in A. 
For every vertex v € A all copies of v have the same neighborhood in B as v. 
We would like to prove that there is a matching M from A’ into B in G”. If we 
prove this, then by identifying the endpoints of M corresponding to the copies 
of vertices from A, we obtain a q-expansion in G. It suffices to check that 
the assumptions of Hall’s theorem are satisfied in G”. Assume otherwise, that 
there is a set X C A’ such that |Na (X)| < |X|. Without loss of generality, 
we can assume that if X contains some copy of a vertex v, then it contains 
all the copies of v, since including all the remaining copies increases |X| but 
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does not change |Na (X)|. Hence, the set X in A’ naturally corresponds to 
the set X4 of size |X|/q in A, the set of vertices whose copies are in X. But 
then |N@(X4)| = |Ne(X)| < |X| = g|Xa], which is a contradiction. Hence 
A’ has a matching into B and thus A has a q-expansion into B. 

For the algorithmic claim, note that, if there is no qexpansion from A 
into B, then we can use Theorem to find a set X C A’ such that 
|Na (X)| < |X|, and the corresponding set X4 satisfies |Ng(Xa)| < q|Xal. 














Finally, we are ready to prove a lemma analogous to Lemma 


Lemma 2.18. (Expansion lemma) Let q > 1 be a positive integer and G 
be a bipartite graph with vertex bipartition (A, B) such that 


(i) |B| > q| A|, and 


(ii) there are no isolated vertices in B. 
Then there exist nonempty vertex sets X C A and Y C B such that 


e there is a q-expansion of X into Y, and 
e no vertex in Y has a neighbor outside X, that is, N(Y) C X. 


Furthermore, the sets X and Y can be found in time polynomial in the size 


of G. 


Note that the sets X, Y and V (G) \ (X UY) form a crown decomposition 
of G with a stronger property — every vertex of X is not only matched into 
Y, but there is a q-expansion of X into Y. We proceed with the proof of 
expansion lemma. 


Proof. We proceed recursively, at every step decreasing the cardinality of A. 
When |A| = 1, the claim holds trivially by taking X = A and Y = B. 

We apply Lemma [2.17] to G. If A has a q-expansion into B, then we 
are done as we may again take X = A and Y = B. Otherwise, we can in 
polynomial time find a (nonempty) set Z C A such that |N(Z)| < q|Z|. We 
construct the graph G’ by removing Z and N(Z) from G. We claim that G” 
satisfies the assumptions of the lemma. Indeed, because we removed less than 
q times more vertices from B than from A, we have that (i) holds for G’. 
Moreover, every vertex from B \ N(Z) has no neighbor in Z, and thus (ii) 
also holds for G’. Note that Z # A, because otherwise N(A) = B (there are 
no isolated vertices in B) and |B| > q|A|. Hence, we recurse on the graph G” 
with bipartition (A \ Z, B \ N(Z)), obtaining nonempty sets X C A\ Z and 
Y C B\ N(Z) such that there is a q-expansion of X into Y and such that 
Na (Y) C X. Because Y C B\ N(Z), we have that no vertex in Y has a 
neighbor in Z. Hence, Na (Y) = Na(Y) C X and the pair (X,Y) satisfies 
all the required properties. 
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The expansion lemma is useful when the matching saturating the head 
part H in the crown lemma turns out to be not sufficient for a reduction, 
and we would like to have a few vertices of the crown C matched to 
a single vertex of the head H. For example, this is the case for the 
FEEDBACK VERTEX SET kernel presented in Section|9.1] where we need 
the case q = 2. 


2.5 Kernels based on linear programming 


In this section we design a 2k-vertex kernel for VERTEX COVER exploiting 
the solution to a linear programming formulation of VERTEX COVER. 

Many combinatorial problems can be expressed in the language of INTE- 
GER LINEAR PROGRAMMING (ILP). In an INTEGER LINEAR PROGRAMMING 
instance, we are given a set of integer-valued variables, a set of linear inequal- 
ities (called constraints) and a linear cost function. The goal is to find an (in- 
teger) evaluation of the variables that satisfies all constraints, and minimizes 
or maximizes the value of the cost function. 

Let us give an example on how to encode a VERTEX COVER instance (G, k) 
as an INTEGER LINEAR PROGRAMMING instance. We introduce n = |V(G)| 
variables, one variable x, for each vertex v € V(G). Setting variable x, to 1 
means that v is in the vertex cover, while setting x, to 0 means that v is not 
in the vertex cover. To ensure that every edge is covered, we can introduce 
constraints zu +£, > 1 for every edge uv € E(G). The size of the vertex cover 
is given by vev(G) Ly. In the end, we obtain the following ILP formulation: 


minimize >) ,<y(q) Xv 

subject to £u + £, > 1 for every uv € E(G), 
O0<a,<1 for every v € V(G), 
Ly EZ for every v € V(G). 


(2.1) 


Clearly, the optimal value of is at most k if and only if G has a vertex 
cover of size at most k. 

As we have just seen, INTEGER LINEAR PROGRAMMING is at least as hard 
as VERTEX COVER, so we do not expect it to be polynomial-time solvable. In 
fact, it is relatively easy to express many NP-hard problems in the language of 
INTEGER LINEAR PROGRAMMING. In Section [6.2| we discuss FPT algorithms 
for INTEGER LINEAR PROGRAMMING and their application in proving fixed- 
parameter tractability of other problems. 

Here, we proceed in a different way: we relax the integrality requirement 
of INTEGER LINEAR PROGRAMMING, which is the main source of the hard- 
ness of this problem, to obtain LINEAR PROGRAMMING. That is, in LINEAR 
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PROGRAMMING the instance looks exactly the same as in INTEGER LINEAR 
PROGRAMMING, but the variables are allowed to take arbitrary real values, 
instead of just integers. 

In the case of VERTEX COVER, we relax by dropping the constraint 
Ly E Z for every v € V(G). In other words, we obtain the following LINEAR 
PROGRAMMING instance. For a graph G, we call this relaxation LP VC(G). 


minimize vev(G) Ly 
subject to £u + £y > 1 for every uv € E(G), (2.2) 
O0<a, <1 for every v € V(G). 


Note that constraints x, < 1 can be omitted because every optimal solu- 
tion of LPVC(G) satisfies these constraints. 

Observe that in LPVC(G), a variable x, can take fractional values in the 
interval [0, 1], which corresponds to taking “part of the vertex v” into a vertex 
cover. Consider an example of G being a triangle. A minimum vertex cover 
of a triangle is of size 2, whereas in LPVC(G) we can take x, = 4 for every 
v € V(G), obtaining a feasible solution of cost 3. Thus, LPVC(G) does not 
express exactly the VERTEX COVER problem on graph G, but its optimum 
solution can still be useful to learn something about minimum vertex covers 
in G. 

The main source of utility of LINEAR PROGRAMMING comes from the 
fact that LINEAR PROGRAMMING can be solved in polynomial time, even in 
some general cases where there are exponentially many constraints, accessed 
through an oracle. For this reason, LINEAR PROGRAMMING has found abun- 
dant applications in approximation algorithms (for more on this topic, we 
refer to the book of Vazirani [427]). In this section, we use LP to design a 
small kernel for VERTEX COVER. In Section we will use LPVC(G) to 
obtain an FPT branching algorithm for VERTEX COVER. 


Let us now have a closer look at the relaxation LPVC(G). Fix an optimal 
solution (%,),»ev(g) of LP VC(G). In this solution the variables corresponding 
to vertices of G take values in the interval [0, 1]. We partition V(G) according 
to these values into three sets as follows. 

e Vo={veEV(G) : t<i}, 

e Vi={veV(G) : t= eh 

e Yi={veEV(G) : 2, > $}. 

Theorem 2.19 (Nemhauser-Trotter theorem). There is a minimum 
vertex cover S of G such that 


YUcCSCWUUN. 
Proof. Let S* C V(G) be a minimum vertex cover of G. Define 


S = (S*\ Vœ) U Vi. 
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By the constraints of (2.2), every vertex of Vo can have a neighbor only in 
V, and thus S is also a vertex cover of G. Moreover, V C S C Vy U Vi. It 
suffices to show that S is a minimum vertex cover. Assume the contrary, i.e., 
|S| > |S*|. Since |S] = |S*| — |Vo N S*| + |Vi \ S*| we infer that 


[Vo S*| < [Vi \ 5". (2.3) 


Let us define 
e=min{|z,— 4| : v E€ VUV} 

We decrease the fractional values of vertices from V; \ S* by £ and increase 
the values of vertices from Vo N S* by e. In other words, we define a vector 
(Yv )veVv(G) as 

Ly—e fveVi\ SS, 

Y= tte ifueVYns*, 
Ly otherwise. 


Note that € > 0, because otherwise Vo = V; = 0, a contradiction with (2.3). 
This, together with (2.3), implies that 


Yo oms X ee (2.4) 


vEV (G) vEV (G) 


Now we show that (y,),ev(q) is a feasible solution, i.e., it satisfies the con- 
straints of LPVC(G). Since (£4)vev(c) is a feasible solution, by the defi- 
nition of € we get 0 < y, < 1 for every v € V(G). Consider an arbitrary 
edge uv € E(G). If none of the endpoints of uv belong to Vi \ S*, then both 
Yu > Zu and Yy > Ly, SO Yu + Yv > Lu + £y => 1. Otherwise, by symmetry we 
can assume that u € Vi \ 9*, and hence yu = £u — €. Because $* is a vertex 
cover, we have that v € S*. If v € Vo N S*, then 


Yu + Yv = Lu — E + Ly + E = Tu + Ly È 1. 


Otherwise, v € (Vi U V1) A S*. Then y, > ty > 4. Note also that zu —€ > 4 
by the definition of e. It follows that 
oe ee ae 
Yu T Yv = Tu — E Woo goo = 
Thus (Yv)vev(G) is a feasible solution of LPVC(G) and hence (2.4) contra- 
dicts the optimality of (ay) yev(a)- 

Theorem allows us to use the following reduction rule. 
Reduction VC.4. Let (£v)vev(a) be an optimum solution to LPVC(G) in 
a VERTEX COVER instance (G, k) and let Vo, Vi and Vi be defined as above. 
If vev(a) Zy > k, then conclude that we are dealing with a no-instance. 


Otherwise, greedily take into the vertex cover the vertices of Vi. That is, 
delete all vertices of Vo U Vi, and decrease k by |Vj|. 
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Let us now formally verify the safeness of Reduction 
Lemma 2.20. Reduction is safe. 


Proof. Clearly, if (G,k) is a yes-instance, then an optimum solution to 
LPVC(G) is of cost at most k. This proves the correctness of the step if 
we conclude that (G, k) is a no-instance. 

Let G’ = G- (VoU Vi) = G[V1] and k’ = k- | Vi]. We claim that (G, k) is a 
yes-instance of VERTEX COVER if and only if (G’, k’) is. By Theorem [2.19] we 
know that G has a vertex cover S of size at most k such that V C S C VYiUVi2. 
Then S’ = SN Vi is a vertex cover in G” and the size of S’ is at most 
k- |V| =k’. 

For the opposite direction, let S” be a vertex cover in G”. For every solution 
of LPVC(G), every edge with an endpoint from Vo should have an endpoint 
in Vi. Hence, S = S’ U Vi is a vertex cover in G and the size of this vertex 
cover is at most k’ + |Vi| = k. 














Reduction leads to the following kernel for VERTEX COVER. 
Theorem 2.21. VERTEX COVER admits a kernel with at most 2k vertices. 


Proof. Let (G,k) be an instance of VERTEX COVER. We solve LPVC(G) 
in polynomial time, and apply Reduction to the obtained solution 
(Tv) vev(a), either concluding that we are dealing with a no-instance or ob- 
taining an instance (G’,k’). Lemma [2.20] guarantees the safeness of the re- 
duction. For the size bound, observe that 


Vie l= al= >) 252 XD ays 2k. 


vEV, vEV (G) 














While it is possible to solve linear programs in polynomial time, usually 
such solutions are less efficient than combinatorial algorithms. The specific 
structure of the LP-relaxation of the vertex cover problem allows us to 
solve it by reducing to the problem of finding a maximum-size matching in a 
bipartite graph. 


Lemma 2.22. For a graph G with n vertices and m edges, the optimal 
(fractional) solution to the linear program LPVC(G) can be found in time 


O(myn). 


Proof. We reduce the problem of solving LPVC(G) to a problem of finding 
a minimum-size vertex cover in the following bipartite graph H. Its vertex 
set consists of two copies Vı and V2 of the vertex set of G. Thus, every 
vertex v € V(G) has two copies vı € Vi; and v2 € Vz in H. For every edge 
uv € E(H), we have edges uiv and vius in H. 
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Using the Hopcroft-Karp algorithm (Theorem B-13), we can find a mini- 
mum vertex cover S of H in time O(m,/n). We define a vector (2y)yev(a) 
as follows: if both vertices vı and v2 are in S, then x, = 1. If exactly one of 
the vertices vı and v2 is in S, we put £y = 5. We put x, = 0 if none of the 
vertices vı and v2 are in S. Thus 


Since S is a vertex cover in H, we have that for every edge uv € E(G) at 
least two vertices from {u1, U2, V1, V2} should be in S. Thus 2, +2, > 1 and 
vector (%y),ev(q) satisfies the constraints of LPVC(G). 

To show that (2y)y,evig) is an optimal solution of LPVC(G), we argue 
as follows. Let (yv)vev(a) be an optimal solution of LPVC(G). For every 
vertex v;i, i € {1,2}, of H, we assign the weight w(v;) = yy. This weight 
assignment is a fractional vertex cover of H, i.e., for every edge vuz € E(H), 
w(v1) + w(u2) > 1. We have that 


E w=5 D (win) + wl). 


vEV (G) vEV (G) 


On the other hand, the value >) ,cy(z) W(v) of any fractional solution of 
LPVC(#) is at least the size of a maximum matching M in H. A reader 
familiar with linear programming can see that this follows from weak duality; 
we also ask you to verify this fact in Exercise 

By Kőnig’s theorem (Theorem [2.11), M| = |S|. Hence 


F te 5 y (w(v1) + w(v2)) = 5 5 wo)> Èl = 5 Ly. 


vEV (G) vEV (G) vEV(H) vEV(G) 











Thus (x£v)vev(a) is an optimal solution of LPVC(G). 











We immediately obtain the following. 


Corollary 2.23. For a graph G with n vertices and m edges, the kernel of 
Theorem can be found in time O(myn). 


The following proposition is another interesting consequence of the proof 
of Lemma [2.22 


Proposition 2.24. Let G be a graph on n vertices and m edges. Then 
LPVC(G) has a half-integral optimal solution, i.e., all variables have values 
in the set {0, E, 1}. Furthermore, we can find a half-integral optimal solution 


in time O(myn). 


38 2 Kernelization 


In short, we have proved properties of LPVC(G). There exists a half- 
integral optimal solution (7,),<ev(q) to LPVC(G), and it can be found 
efficiently. We can look at this solution as a partition of V(G) into 
parts Vo, Vi, and V, with the following message: greedily take V into a 
solution, do nae take any vertex of Vo into a solution, and in V1, we do 
not know what to do and that i is the hard part of the problem. However 
as an optimum solution pays 5 1 for every vertex of V1, the hard part — 
the kernel of the problem — ne have more than 2k vertices. 


2.6 Sunflower lemma 


In this section we introduce a classical result of Erdős and Rado and show 
some of its applications in kernelization. In the literature it is known as the 
sunflower lemma or as the Erdős-Rado lemma. We first define the terminology 
used in the statement of the lemma. A sunflower with k petals and a core Y 
is a collection of sets S1,..., Sp such that S; N Sj = Y for all i 4 j; the sets 
S;\Y are petals and we require none of them to be empty. Note that a family 
of pairwise disjoint sets is a sunflower (with an empty core). 


Theorem 2.25 (Sunflower lemma). Let A be a family of sets (without 
duplicates) over a universe U, such that each set in A has cardinality exactly 
d. If |A| > d\(k —1)4, then A contains a sunflower with k petals and such a 
sunflower can be computed in time polynomial in |A|, |U|, and k. 


Proof. We prove the theorem by induction on d. For d = 1, i.e., for a family 
of singletons, the statement trivially holds. Let d > 2 and let A be a family 
of sets of cardinality at most d over a universe U such that |A| > d!(k —1)¢. 

Let G = {S1,...,S¢} C A be an inclusion-wise maximal family of pairwise 
disjoint sets in A. If € > k then G is a sunflower with at least k petals. Thus 
we assume that  < k. Let S = D S;. Then |S| < d(k — 1). Because G is 
maximal, every set A € A intersects at least one set from G, i.e., AN S FO. 
Therefore, there is an element u € U contained in at least 


Al dk- 
[S| ~ d(k—1 





= (d—1)\(k—1)* 


sets from A. We take all sets of A containing such an element u, and construct 
a family A’ of sets of cardinality d—1 by removing from each set the element 
u. Because |.A’| > (d—1)!(k—1)4~1, by the induction hypothesis, A’ contains 
a sunflower {5{,...,5;,} with k petals. Then {Sj U {u},...,5, U {u}} is a 
sunflower in A with k petals. 

The proof can be easily transformed into a polynomial-time algorithm, as 
follows. Greedily select a maximal set of pairwise disjoint sets. If the size 


2.6 Sunflower lemma 39 


of this set is at least k, then return this set. Otherwise, find an element 
u contained in the maximum number of sets in A, and call the algorithm 
recursively on sets of cardinality d — 1, obtained from deleting u from the 
sets containing u. 














2.6.1 d-HITTING SET 


As an application of the sunflower lemma, we give a kernel for d-HITTING 
SET. In this problem, we are given a family A of sets over a universe U, where 
each set in the family has cardinality at most d, and a positive integer k. The 
objective is to decide whether there is a subset H C U of size at most k such 
that H contains at least one element from each set in A. 


Theorem 2.26. d-HITTING SET admits a kernel with at most d!k@ sets and 
at most d!kt - dœ elements. 


Proof. The crucial observation is that if A contains a sunflower 
S = {S4, . ., Sk+1} 


of cardinality k + 1, then every hitting set H of A of cardinality at most k 
intersects the core Y of the sunflower S. Indeed, if H does not intersect Y, 
it should intersect each of the k + 1 disjoint petals S; \ Y. This leads to the 
following reduction rule. 


Reduction HS.1. Let (U, A, k) be an instance of d-HITTING SET and as- 
sume that A contains a sunflower S = {S1,..., Sk+1} of cardinality k + 1 
with core Y. Then return (U’, A’, k), where A’ = (A \ S) U {Y} is obtained 
from A by deleting all sets {5),...,S,41} and by adding a new set Y and 
U' = Uxes X. 


Note that when deleting sets we do not delete the elements contained in 
these sets but only those which do not belong to any set. Then the instances 
(U, A, k) and (U’,A’,k) are equivalent, i.e. (U, A) contains a hitting set of 
size k if and only if (U, A’) does. 

The kernelization algorithm is as follows. If for some d’ € {1,...,d} the 
number of sets in A of size exactly d’ is more than d'!k”, then the kerneliza- 
tion algorithm applies the sunflower lemma to find a sunflower of size k + 1, 
and applies Reduction [HS.1] on this sunflower. It applies this procedure ex- 
haustively, and obtains a new family of sets A’ of size at most d!k? - d. If 
Ø € A’ (that is, at some point a sunflower with an empty core has been 
discovered), then the algorithm concludes that there is no hitting set of size 
at most k and returns that the given instance is a no-instance. Otherwise, 
every set contains at most d elements, and thus the number of elements in 
the kernel is at most d!k@ - d?. 
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Exercises 


2.1 (Z). Prove Lemma [2.5] A digraph is acyclic if and only if it is possible to order its 
vertices in such a way such that for every arc (u,v), we have u < v. 


2.2 (@&). Give an example of a feedback arc set F in a tournament G, such that G ® F is 
not acyclic. 


2.3 (2). Show that Reductions and [ECC.3|are safe. 


2.4 (Z). In the Point Line COVER problem, we are given a set of n points in the plane 
and an integer k, and the goal is to check if there exists a set of k lines on the plane that 
contain all the input points. Show a kernel for this problem with O(k?) points. 


2.5. A graph G is a cluster graph if every connected component of G is a clique. In the 
CLUSTER EDITING problem, we are given as input a graph G and an integer k, and the 
objective is to check whether one can edit (add or delete) at most k edges in G to obtain a 
cluster graph. That is, we look for a set F C (V9) of size at most k, such that the graph 
(V(G), (E(G) \ F) U (F \ E(G))) is a cluster graph. 


1. Show that a graph G is a cluster graph if and only if it does not have an induced path 
on three vertices (sequence of three vertices u,v, w such that uv and vw are edges and 
uw ¢ E(G)). 


2. Show a kernel for CLUSTER EDITING with O(k?) vertices. 


2.6. In the SET SPLITTING problem, we are given a family of sets F over a universe U and 
a positive integer k, and the goal is to test whether there exists a coloring of U with two 
colors such that at least k sets in F are nonmonochromatic (that is, they contain vertices 
of both colors). Show that the problem admits a kernel with at most 2k sets and O(k?) 
universe size. 


2.7. In the Minimum MAXIMAL MATCHING problem, we are given an undirected graph G 
and a positive integer k, and the objective is to decide whether there exists a maximal 
matching in G on at most k edges. Obtain a polynomial kernel for the problem (parame- 
terized by k). 


2.8. In the Min-OneEs-2-SAT problem, we are given a 2-CNF formula ¢ and an integer 
k, and the objective is to decide whether there exists a satisfying assignment for ¢ with at 
most k variables set to true. Show that Min-OneEs-2-SAT admits a polynomial kernel. 


2.9. In the d- BoUNDED-DEGREE DELETION problem, we are given an undirected graph G 
and a positive integer k, and the task is to find at most k vertices whose removal decreases 
the maximum vertex degree of the graph to at most d. Obtain a kernel of size polynomial 
in k and d for the problem. (Observe that VERTEX CovER is the case of d = 0.) 


2.10. Show a kernel with O(k?) vertices for the following problem: given a graph G and 
an integer k, check if G contains a subgraph with exactly k edges, whose vertices are all 
of odd degree in the subgraph. 


2.11. A set of vertices D in an undirected graph G is called a dominating set if N[D] = 
V(G). In the DOMINATING SET problem, we are given an undirected graph G and a positive 
integer k, and the objective is to test whether there exists a dominating set of size at most 
k. Show that DominaTING SET admits a polynomial kernel on graphs where the length of 
the shortest cycle is at least 5. (What would you do with vertices with degree more than 
k? Note that unlike for the VERTEX COVER. problem, you cannot delete a vertex once you 
pick it in the solution.) 
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2.12. Show that FEEDBACK VERTEX SET admits a kernel with O(k) vertices on undirected 
regular graphs. 


2.13. We say that an n-vertex digraph is well-spread if every vertex has indegree at least 
Jn. Show that DIRECTED FEEDBACK Arc SET, restricted to well-spread digraphs, is FPT 
by obtaining a polynomial kernel for this problem. Does the problem remain FPT if we 
replace the lower bound on indegree by any monotonically increasing function of n (like 
logn or log log log n)? Does the assertion hold if we replace indegree with outdegree? What 
about DIRECTED FEEDBACK VERTEX SET? 


2.14. In the CONNECTED VERTEX COVER problem, we are given an undirected graph G 
and a positive integer k, and the objective is to decide whether there exists a vertex cover 
C of G such that |C| < k and G[C] is connected. 


1. Explain where the kernelization procedure described in Theorem for VERTEX 
Cover breaks down for the CONNECTED VERTEX COVER problem. 

2. Show that the problem admits a kernel with at most 2} + O(k?) vertices. 

3. Show that if the input graph G does not contain a cycle of length 4 as a subgraph, 
then the problem admits a kernel with at most O(k?) vertices. 


2.15 ($). Extend the argument of the previous exercise to show that, for every fixed 
d > 2, CONNECTED VERTEX COVER admits a kernel of size O(k7) if restricted to graphs 
that do not contain the biclique Kqg,q as a subgraph. 


2.16 (2). A graph G is chordal if it contains no induced cycles of length more than 3, that 
is, every cycle of length at least 4 has a chord. In the CHORDAL COMPLETION problem, we 
are given an undirected graph G and a positive integer k, and the objective is to decide 
whether we can add at most k edges to G so that it becomes a chordal graph. Obtain a 
polynomial kernel for CHORDAL COMPLETION (parameterized by k). 


2.17 (È). In the Epee Dissoint CYCLE PACKING problem, we are given an undirected 
graph G and a positive integer k, and the objective is to test whether G has k pairwise 
edge disjoint cycles. Obtain a polynomial kernel for EDGE DisjoInT CYCLE PACKING 
(parameterized by k). 


2.18 (2). A bisection of a graph G with an even number of vertices is a partition of V (G) 
into Vj and V2 such that |Vi| = |V2|. The size of (Vi, V2) is the number of edges with one 
endpoint in Vj and the other in V2. In the Maximum BIsEcTION problem, we are given 
an undirected graph G with an even number of vertices and a positive integer k, and the 
objective is to test whether there exists a bisection of size at least k. 


1. Show that every graph with m edges has a bisection of size at least [J]. Use this to 
show that Maximum BISECTION admits a kernel with 2k edges. 

2. Consider the following “above guarantee" variant of Maximum BISECTION, where we 
are given an undirected graph G and a positive integer k, but the objective is to test 
whether there exists a bisection of size at least [-]+k. Show that the problem admits 
a kernel with O(k?) vertices and O(k*) edges. 


2.19 (&). Byteland, a country of area exactly n square miles, has been divided by the 
government into n regions, each of area exactly one square mile. Meanwhile, the army of 
Byteland divided its area into n military zones, each of area again exactly one square mile. 
Show that one can build n airports in Byteland, such that each region and each military 
zone contains one airport. 


2.20. A magician and his assistant are performing the following magic trick. A volunteer 
from the audience picks five cards from a standard deck of 52 cards and then passes the 
deck to the assistant. The assistant shows to the magician, one by one in some order, four 
cards from the chosen set of five cards. Then, the magician guesses the remaining fifth 
card. Show that this magic trick can be performed without any help of magic. 
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2.21. Prove the second claim of Theorem [2.13 


2.22. In the DUAL-CoLoRING problem, we are given an undirected graph G on n vertices 
and a positive integer k, and the objective is to test whether there exists a proper coloring 
of G with at most n — k colors. Obtain a kernel with O(k) vertices for this problem using 
crown decomposition. 


2.23 (2). In the Max-INTERNAL SPANNING TREE problem, we are given an undirected 
graph G and a positive integer k, and the objective is to test whether there exists a spanning 
tree with at least k internal vertices. Obtain a kernel with O(k) vertices for MAx-INTERNAL 
SPANNING TREE. 


2.24 (Z). Let G be an undirected graph, let (av),ey(q) be any feasible solution to 
LPVC(G), and let M be a matching in G. Prove that |M| < vev(a) Ly: 


2.25 (%). Let G be a graph and let (tv)yev(g) be an optimum solution to LPVC(G) 
(not necessarily a half-integral one). Define a vector (yv)vev (c) as follows: 


if ty < 
i ay = 
if ty > 


Yv = 


= ne © 
NIP NIH NI = 


Show that (yv)vev (cG) is also an optimum solution to LPVC(G). 


2.26 (2). In the Mrn-Ones-2-SAT, we are given a CNF formula, where every clause has 
exactly two literals, and an integer k, and the goal is to check if there exists a satisfying 
assignment of the input formula with at most k variables set to true. Show a kernel for 
this problem with at most 2k variables. 


2.27 (Æ). Consider a restriction of d-Hirtina SET, called Ed-Hirtine SET, where we 
require every set in the input family A to be of size exactly d. Show that this problem 
is not easier than the original d-HITTING SET problem, by showing how to transform a 
d-HitTine SET instance into an equivalent Ed-Hittine SET instance without changing 
the number of sets. 


2.28. Show a kernel with at most f(d)k% sets (for some computable function f) for the 
Ed-Hittine SET problem, defined in the previous exercise. 


2.29. In the d-SreT Pacxkine problem, we are given a family A of sets over a universe 
U, where each set in the family has cardinality at most d, and a positive integer k. The 
objective is to decide whether there are sets S1,...,S, E A that are pairwise disjoint. 
Use the sunflower lemma to obtain a kernel for d-Ser Packine with f(d)k@ sets, for some 
computable function d. 


2.30. Consider a restriction of d-S—ET PACKING, called Ed-SetT Packina, where we require 
every set in the input family A to be of size exactly d. Show that this problem is not easier 
than the original d-SeT Packine problem, by showing how to transform a d-SET PACKING 
instance into an equivalent Ed-SrT PackiNG instance without changing the number of sets. 


2.31. A split graph is a graph in which the vertices can be partitioned into a clique and 
an independent set. In the VERTEX DisJoinT PATHS problem, we are given an undirected 
graph G and k pairs of vertices (s;, ti), i € {1,...,k}, and the objective is to decide whether 
there exists paths P; joining s; to t; such that these paths are pairwise vertex disjoint. 
Show that VERTEX Disjoint Partus admits a polynomial kernel on split graphs (when 
parameterized by k). 
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2.32. Consider now the VERTEX DisJoInT PATHS problem, defined in the previous exercise, 
restricted, for a fixed integer d > 3, to a class of graphs that does not contain a d-vertex 
path as an induced subgraph. Show that in this class the VERTEX DisJoinT PATHS problem 
admits a kernel with O(k?—+) vertices and edges. 


2.33. In the CLUSTER VERTEX DELETION problem, we are given as input a graph G and 
a positive integer k, and the objective is to check whether there exists a set S C V(G) of 
size at most k such that G—S is a cluster graph. Show a kernel for CLUSTER VERTEX 
DELETION with O(k3) vertices. 


2.34. An undirected graph G is called perfect if for every induced subgraph H of G, the 
size of the largest clique in H is same as the chromatic number of H. In the Opp CYCLE 
TRANSVERSAL problem, we are given an undirected graph G and a positive integer k, 
and the objective is to find at most k vertices whose removal makes the resulting graph 
bipartite. Obtain a kernel with O(k?) vertices for Opp CYCLE TRANSVERSAL on perfect 
graphs. 


2.35. In the SPLIT VERTEX DELETION problem, we are given an undirected graph G and 
a positive integer k and the objective is to test whether there exists a set S C V(G) of size 
at most k such that G — S is a split graph (see Exercise for the definition). 


1. Show that a graph is split if and only if it has no induced subgraph isomorphic to 
one of the following three graphs: a cycle on four or five vertices, or a pair of disjoint 
edges. 

2. Give a kernel with O(k*) vertices for SPLIT VERTEX DELETION. 


2.36 (8). In the SPLIT EDGE DELETION problem, we are given an undirected graph G 
and a positive integer k, and the objective is to test whether G can be transformed into 
a split graph by deleting at most k edges. Obtain a polynomial kernel for this problem 
(parameterized by k). 


2.37 (8). In the RamseEY problem, we are given as input a graph G and an integer k, and 
the objective is to test whether there exists in G an independent set or a clique of size at 
least k. Show that Ramsey is FPT. 


2.38 (®,). A directed graph D is called oriented if there is no directed cycle of length at 
most 2. Show that the problem of testing whether an oriented digraph contains an induced 
directed acyclic subgraph on at least k vertices is FPT. 


Hints 


Consider the following reduction rule: if there exists a line that contains more than k 
input points, delete the points on this line and decrease k by 1. 


[2.5] Consider the following natural reduction rules: 


1. delete a vertex that is not a part of any P3 (induced path on three vertices); 
2. if an edge wv is contained in at least k + 1 different P3s, then delete uv; 
3. if a non-edge wv is contained in at least k + 1 different P3s, then add wv. 


Show that, after exhaustive application of these rules, a yes-instance has O(k?) vertices. 


First, observe that one can discard any set in F that is of size at most 1. Second, 
observe that if every set in F is of size at least 2, then a random coloring of U has at least 
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|F|/2 nonmonochromatic sets on average, and an instance with |F| > 2k is a yes-instance. 
Moreover, observe that if we are dealing with a yes-instance and F € F is of size at least 
2k, then we can always tweak the solution coloring to color F nonmonochromatically: fix 
two differently colored vertices for k — 1 nonmonochromatic sets in the solution, and color 
some two uncolored vertices of F with different colors. Use this observation to design a 
reduction rule that handles large sets in F. 


Observe that the endpoints of the matching in question form a vertex cover of the 
input graph. In particular, every vertex of degree larger than 2k needs to be an endpoint 
of a solution matching. Let X be the set of these large-degree vertices. Argue, similarly as 
in the case of O(k?) kernel for VERTEX Cover, that in a yes-instance, G \ X has only few 
edges. Design a reduction rule to reduce the number of isolated vertices of G \ X. 


Proceed similarly as in the O(k?) kernel for VERTEX Cover. 


Proceed similarly as in the case of VERTEX Cover. Argue that the vertices of degree 
larger than d + k need to be included in the solution. Moreover, observe that you may 
delete isolated vertices, as well as edges connecting two vertices of degree at most d. Argue 
that, if no rule is applicable, then a yes-instance is of size bounded polynomially in d+ k. 


The important observation is that a matching of size k is a good subgraph. Hence, 
we may restrict ourselves to the case where we are additionally given a vertex cover X of 
the input graph of size at most 2k. Moreover, assume that X is inclusion-wise minimal. To 
conclude, prove that, if a vertex v € X has at least k neighbors in V(G) \ X, then (G, k) 
is a yes-instance. 


The main observation is that, since there is no 3-cycle nor 4-cycle in the graph, if 
x,y E N(v), then only v can dominate both x and y at once. In particular, every vertex of 
degree larger than k needs to be included in the solution. 

However, you cannot easily delete such a vertex. Instead, mark it as “obligatory” and 
mark its neighbors as “dominated”. Note now that you can delete a “dominated” vertex, 
as long as it has no unmarked neighbor and its deletion does not drop the degree of an 
“obligatory” vertex to k. 

Prove that, in a yes-instance, if no rule is applicable, then the size is bounded polyno- 
mially in k. To this end, show that 


1. any vertex can dominate at most k unmarked vertices, and, consequently, there are 
at most k? unmarked vertices; 

2. there are at most k “obligatory” vertices; 

3. every remaining “dominated” vertex can be charged to one unmarked or obligatory 
vertex in a manner that each unmarked or obligatory vertex is charged at most k + 1 
times. 


Let (G, k) be a FEEDBACK VERTEX SET instance and assume G is d-regular. If d < 2, 
then solve (G,k) in polynomial time. Otherwise, observe that G has dn/2 edges and, if 
(G, k) is a yes-instance and X is a feedback vertex set of G of size at most k, then at most 
dk edges of G are incident to X and G — X contains less than n — k edges (since it is a 
forest). Consequently, dn/2 < dk +n — k, which gives n = O(k) for d > 3. 


Show, using greedy arguments, that if every vertex in a digraph G has indegree at 
least d, then G contains d pairwise edge-disjoint cycles. 

For the vertex-deletion variant, design a simple reduction that boosts up the indegree 
of every vertex without actually changing anything in the solution space. 


Let X be the set of vertices of G of degree larger than k. Clearly, any connected 
vertex cover of G of size at most k needs to contain X. Moreover, as in the case of VERTEX 
Cover, in a yes-instance there are only O(k?) edges in G — X. However, we cannot easily 
discard the isolated vertices of G— X, as they may be used to make the solution connected. 
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To obtain an exponential kernel, note that in a yes-instance, |X| < k, and if we have 
two vertices u,v that are isolated in G — X, and Ng(u) = Ne(v) (note that Ng(u) C X 
for every u that is isolated in G — X), then we need only one of the vertices u,v in a 
CONNECTED VERTEX COVER solution. Hence, in a kernel, we need to keep: 


1. G[X], and all edges and non-isolated vertices of G — X; 

2. for every x € X, some k + 1 neighbors of x; 

3. for every Y C X, one vertex u that is isolated in G — X and Ng(u) = Y (if there 
exists any such vertex). 


For the last part of the exercise, note that in the presence of this assumption, no two 
vertices of X share more than one neighbor and, consequently, there are only O(|X|?) sets 
Y C X for which there exist u ¢ X with NG(u) =Y. 


We repeat the argument of the previous exercise, and bound the number of sets 
Y C X for which we need to keep a vertex u € V(G) \ X with Ng(u) = Y. First, there 
are O(d|X|?—!) sets Y of size smaller than d. Second, charge every set Y of size at least d 
to one of its subset of size d. Since G does not contain Kg,q as a subgraph, every subset X 
of size d is charged less than d times. Consequently, there are at most (d— 1) (1X1) vertices 
u € V(G) \ X such that Nge(u) C X and |NgG(u)| > d. 


The main observation is as follows: an induced cycle of length £ needs exactly £ — 3 
edges to become chordal. In particular, if a graph contains an induced cycle of length 
larger than k + 3, then the input instance is a no-instance, as we need more than k edges 
to triangulate the cycle in question. 

First, prove the safeness of the following two reduction rules: 


1. Delete any vertex that is not contained in any induced cycle in G. 

2. A vertex x is a friend of a non-edge wv, if u, x, v are three consecutive vertices of some 
induced cycle in G. If uv ¢ E(G) has more than 2k friends, then add the edge uv and 
decrease k by one. 


Second, consider the following procedure. Initiate A to be the vertex set of any inclusion- 
wise maximal family of pairwise vertex-disjoint induced cycles of length at most 4 in G. 
Then, as long as there exists an induced cycle of length at most 4 in G that contains two 
consecutive vertices in V(G) \ A, move these two vertices to A. Show, using a charging 
argument, that, in a yes-instance, the size of A remains O(k). Conclude that the size of a 
reduced yes-instance is bounded polynomially in k. 


Design reduction rules that remove vertices of degree at most 2 (you may obtain a 
multigraph in the process). Prove that every n-vertex multigraph of minimum degree at 
least 3 has a cycle of length O(log n). Use this to show a greedy argument that an n-vertex 
multigraph of minimum degree 3 has (n°) pairwise edge-disjoint cycles for some € > 0. 


Consider the following argument. Let |V(G)| = 2n and pair the vertices of G arbi- 
trarily: V(G) = {x1, Y1, £2, Y2,- --, £n, yn}. Consider the bisection (Vi, V2) where, in each 
pair (xi, yi), one vertex goes to Vi and the other goes to V2, where the decision is made 
uniformly at random and independently of other pairs. Prove that, in expectation, the ob- 
tained bisection is of size at least (m + £)/2, where £ is the number of pairs (x;, yi) where 
Lyi E E(G). 

Use the arguments in the previous paragraph to show not only the first point of the 
exercise, but also that the input instance is a yes-instance if it admits a matching of size 
2k. If this is not the case, then let X be the set of endpoints of a maximal matching in G; 
note that |X| < 4k. 

First, using a variation of the argument of the first paragraph, prove that, if there exists 
x E€ X that has at least 2k neighbors and at least 2k non-neighbors outside X, then the 
input instance is a yes-instance. Second, show that in the absence of such a vertex, all but 
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O(k?) vertices of V(G) \ X have exactly the same neighborhood in X, and design a way 
to reduce them. 


Construct the following bipartite graph: on one side there are regions of Byteland, on 
the second side there are military zones, and a region R is adjacent to a zone Z if ROZ #0. 
Show that this graph satisfies the condition of Hall’s theorem and, consequently, contains 
a perfect matching. 


5 
ossibly chosen by the volunteer), and on the other side there are all 52-51-50- 49 tuples 


of pairwise different four cards (possibly shown by the assistant). A set S is adjacent to a 
tuple T if all cards of T belong to S. Using Hall’s theorem, show that this graph admits a 
matching saturating the side with all sets of five cards. This matching induces a strategy 
for the assistant and the magician. 

We now show a relatively simple explicit strategy, so that you can impress your friends 
and perform this trick at some party. In every set of five cards, there are two cards of the 
same color, say a and b. Moreover, as there are 13 cards of the same color, the cards a and 
b differ by at most 6, that is, a+i = b or b+i = a for some 1 < i < 6, assuming some cyclic 
order on the cards of the same color. Without loss of generality, assume a+ i = b. The 
assistant first shows the card a to the magician. Then, using the remaining three cards, and 
some fixed total order on the whole deck of cards, the assistant shows the integer i (there 
are 3! = 6 permutations of remaining three cards). Consequently, the magician knows the 
card b by knowing its color (the same as the first card show by the assistant) and the value 
of the card a and the number i. 


Consider the following bipartite graph: on one side there are all C) sets of five cards 
p 


Let M be a maximum matching, which you can find using the Hopcroft-Karp algo- 
rithm (the first part of Theorem B-13). If M saturates Vi, then we are done. Otherwise, 
pick any v € Vi \V(M) (i.e., a vertex v € Vi that is not an endpoint of an edge of M) and 
consider all vertices of G that are reachable from v using alternating paths. (A path P is 
alternating if every second edge of P belongs to M.) Show that all vertices from Vi that 
are reachable from v using alternating paths form an inclusion-wise minimal set X with 
IN(X)| < |X]. 

[2.22] Apply the crown lemma to G, the edge complement of G (G has vertex set V(G) 
and wv € E(G) if and only if uv ¢ E(G)) and the parameter k — 1. If it returns a matching 
Mo of size k, then note that one can color the endpoints of each edge of Mo with the same 
color, obtaining a coloring of G with n—k colors. Otherwise, design a way to greedily color 
the head and the crown of the obtained crown decomposition. 


Your main tool is the following variation of the crown lemma: if V(G) is sufficiently 
large, then you can find either a matching of size k + 1, or a crown decomposition V(G) = 
C U HU R, such that G[H UC] admits a spanning tree where all vertices of H and |H|—1 
vertices of C are of degree at least two. Prove it, and use it for the problem in question. 


Observe that for every uv € M we have x, +2, > 1 and, moreover, all these 
inequalities for all edges of M contain different variables. In other words, 


5 w(xy) > 5 w(ty) = 5 (w(£v) + w(zu)) > 5 1= |M]. 


vEV (G) vEV(M) vueM vueM 


= Let Vs = {v E€ V(G) : 0 < ay < Ł} and Vis = {v E€ V(G) : 4 < ay < 1}. 
or sufficiently small € > 0, consider two operations: first, an operation of adding € to all 
variables x, for v € Vs and subtracting ¢ from z£, for v € Vj_5, and second, an operation of 
adding £ to all variables x, for v € Vij_5 and subtracting ¢ from x, for v € Vs. Show that 
both these operations lead to feasible solutions to LPVC(G), as long as £ is small enough. 
Conclude that |Vs| = |Vi_5|, and that both operations lead to other optimal solutions 
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to LPVC(G). Finally, observe that, by repeatedly applying the second operation, one can 
empty sets Vs and Vj_5, and reach the vector (yo)yev(a): 


First, design natural reduction rules that enforce the following: for every variable x 
in the input formula y, there exists a truth assignment satisfying y that sets x to false, 
and a truth assignment satisfying y that sets x to true. In other words, whenever a value 
of some variable is fixed in any satisfying assignment, fix this value and propagate it in the 
formula. 

Then, consider the following closure of the formula y: for every two-literal clause C that 
is satisfied in every truth assignment satisfying p, add C to y. Note that testing whether 
C is such a clause can be done in polynomial time: force two literals in C to be false and 
check if y remains satisfiable. Moreover, observe that the sets of satisfying assignments for 
y and y’ are equal. 

Let y’ be the closure of p. Consider the following auxiliary graph H: V(H) is the set of 
variables of y’, and xy € E(H) iff the clause xV y belongs to y’. Clearly, if we take any truth 
assignment w satisfying y, then 7—1(T) is a vertex cover of H. A somewhat surprising fact 
is that a partial converse is true: for every inclusion-wise minimal vertex cover X of H, 
the assignment 7 defined as y(x) = T if and only if x € X satisfies y’ (equivalently, vy). 
Note that such a claim would solve the exercise: we can apply the LP-based kernelization 
algorithm to VERTEX Cover instance (H, k), and translate the reductions it makes back 
to the formula g. 

Below we prove the aforementioned claim in full detail. We encourage you to try to 
prove it on your own before reading. 

Let X be a minimal vertex cover of H, and let y be defined as above. Take any clause 
C in y’ and consider three cases. If C = x V y, then xy € E(H), and, consequently, either 
x or y belongs to X. It follows from the definition of y that y(x) = T or y(y) = T, and 
w satisfies C. 

In the second case, C = x V ~y. For a contradiction, assume that ~ does not satisfy 
C and, consequently, x ¢ X and y € X. Since X is a minimal vertex cover, there exists 
z E€ Ny (y) such that z ¢ X and the clause C’ = y V z belongs to y’. If z = x, then any 
satisfying assignment to y’ sets y to true, a contradiction to our first preprocessing step. 
Otherwise, the presence of C and C’ implies that in any assignment 7’ satisfying y’ we 
have w'(x) = T or w’(z) = T. Thus, x V z is a clause of y’, and xz € E(H). However, 
neither x nor z belongs to X, a contradiction. 

In the last case, C = ~z V ~y and, again, we assume that ~ does not satisfy C, that is, 
x,y € X. Since X is a minimal vertex cover, there exist s E€ Ny(x), t E€ Ny (y) such that 
s,t ¢ X. It follows from the definition of H that the clauses Cs = x V s and Cy =yVt 
are present in y’. If s = t, then the clauses C, Cy and Cy imply that t is set to true in 
any truth assignment satisfying y’, a contradiction to our first preprocessing step. If s Æ t, 
then observe that the clauses C, Cz and Cy imply that either s or t is set to true in any 
truth assignment satisfying y’ and, consequently, s V t is a clause of y’ and st € E(H). 
However, s,t ¢ X, a contradiction. 





If Ø € A, then conclude that we are dealing with a no-instance. Otherwise, for every 
set X € A of size |X| < d, create d — |X| new elements and add them to X. 


There are two ways different ways to solve this exercise. First, you can treat the 
input instance as a d-HiTTING SET instance, proceed as in Section [2.6.1] and at the end 
apply the solution of Exercise [2.27]to the obtained kernel, in order to get an Ed-HITTING 
SET instance. 

In a second approach, try to find a sunflower with k + 2 sets, instead of k + 1 as in 
Section [2.6.1] If a sunflower is found, then discard one of the sets: the remaining k + 1 sets 
still ensure that the core needs to be hit in any solution of size at most k. 
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Show that in a (dk + 2)-sunflower, one set can be discarded without changing the 
answer to the problem. 


Proceed as in Exercise pad every set X € A with d—|X| newly created elements. 


2.31| Show that, in a split graph, every path can be shortened to a path on at most 
four vertices. Thus, for every 1 < i < k, we have a family F; of vertex sets of possible 
paths between s; and t;, and this family is of size O(n*). Interpret the problem as a d-SET 
PACKING instance for some constant d and family F = aa Fi. Run the kernelization 
algorithm from the previous exercise, and discard all vertices that are not contained in any 
set in the obtained kernel. 


[2.32] Show that, in a graph excluding a d-vertex path as an induced subgraph, every 
path in a solution can shortened to a path on at most d — 1 vertices. Proceed then as in 
Exercise [2.31 


[2.33] Let A be a family of all vertex sets of a Ps (induced path on three vertices) in G. In 
this manner, CLUSTER VERTEX DELETION becomes a 3-HiTTING SET problem on family 
A, as we need to hit all induced P3s in G. Reduce A, but not exactly as in the d-HITTING 
SET case: repeatedly find a k + 2 sunflower and delete one of its elements from A. Show 
that this reduction is safe for CLUSTER VERTEX DELETION. Moreover, show that, if A’ 
is the family after the reduction is exhaustively applied, then (G[U A’], k) is the desired 
kernel. 


Use the following observation: a perfect graph is bipartite if and only if it does not 
contain a triangle. Thus, the problem reduces to hitting all triangles in the input graph, 
which is a 3-HITTING SET instance. 


[2.35] Proceed as in the case of CLUSTER VERTEX DELETION: interpret a SPLIT VERTEX 
DELETION instance as a 5-HITTING SET instance. 


Let {C4,Cs5,2K2} be the set of forbidden induced subgraphs for split graphs. That 
is, a graph is a split graph if it contains none of these three graphs as an induced subgraph. 

You may need (some of) the following reduction rules. (Note that the safeness of some 
of them is not so easy.) 


1. The “standard” sunflower-like: if more than k forbidden induced subgraphs share a 
single edge (and otherwise are pairwise edge-disjoint), delete the edge in question. 

2. The irrelevant vertex rule: if a vertex is not part of any forbidden induced subgraph, 
then delete the vertex in question. (Safeness is not obvious here!) 

3. If two adjacent edges uv and uw are contained in more than k induced C4s, then 
delete uv and uw, and replace them with edges va and wb, where a and b are new 
degree-1 vertices. 

4. If two adjacent edges uv and uw are contained in more than k pairwise edge-disjoint 
induced Css, then delete uv and ww, and decrease k by 2. 

5. If the edges v1v2, v2v3 and v3v4 are contained in more than k induced Css, delete 
v2v3 and decrease k by 1. 


By induction, show that every graph on at least 4” vertices has either a clique or 
an independent set on k vertices. Observe that this implies a kernel of exponential size for 
the problem. 


Show that every tournament on n vertices has a transitive subtournament on O(log n) 
vertices. Then, use this fact to show that every oriented directed graph on n vertices has an 
induced directed acyclic subgraph on logn vertices. Finally, obtain an exponential kernel 
for the considered problem. 
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Chapter 3 
Bounded search trees 


In this chapter we introduce a variant of exhaustive 
search, namely the method of bounded search trees. 
This is one of the most commonly used tools in the de- 
sign of fixed-parameter algorithms. We illustrate this 
technique with algorithms for two different parameter- 
izations of VERTEX COVER, as well as for the prob- 
lems (undirected) FEEDBACK VERTEX SET and CLos- 
EST STRING. 





Bounded search trees, or simply branching, is one of the simplest and most 
commonly used techniques in parameterized complexity that originates in the 
general idea of backtracking. The algorithm tries to build a feasible solution to 
the problem by making a sequence of decisions on its shape, such as whether 
to include some vertex into the solution or not. Whenever considering one 
such step, the algorithm investigates many possibilities for the decision, thus 
effectively branching into a number of subproblems that are solved one by 
one. In this manner the execution of a branching algorithm can be viewed 
as a search tree, which is traversed by the algorithm up to the point when a 
solution is discovered in one of the leaves. In order to justify the correctness of 
a branching algorithm, one needs to argue that in case of a yes-instance some 
sequence of decisions captured by the algorithm leads to a feasible solution. 
If the total size of the search tree is bounded by a function of the parameter 
alone, and every step takes polynomial time, then such a branching algorithm 
runs in FPT time. This is indeed the case for many natural backtracking 
algorithms. 

More precisely, let J be an instance of a minimization problem (such as 
VERTEX COVER). We associate a measure ju(I) with the instance J, which, in 
the case of FPT algorithms, is usually a function of k alone. In a branch step 
we generate from J simpler instances I1,..., Ip (£ > 2) of the same problem 
such that the following hold. 
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1. Every feasible solution S$ of I;, i € {1,...,€}, corresponds to a feasible 
solution h;(S) of I. Moreover, the set 


{ni(S) : 1<7i< and S is a feasible solution of I 


contains at least one optimum solution for J. Informally speaking, a 
branch step splits problem J into subproblems J,,...,J¢, possibly tak- 
ing some (formally justified) greedy decisions. 
2. The number @ is small, e.g., it is bounded by a function of (JL) alone. 
3. Furthermore, for every J;, i € {1,...,¢}, we have that w(;) < (I) — c 
for some constant c > 0. In other words, in every branch we substantially 
simplify the instance at hand. 


In a branching algorithm, we recursively apply branching steps to instances 
I, I2,...,I¢, until they become simple or even trivial. Thus, we may see an 
execution of the algorithm as a search tree, where each recursive call cor- 
responds to a node: the calls on instances J, [o,...,J¢ are children of the 
call on instance I. The second and third conditions allow us to bound the 
number of nodes in this search tree, assuming that the instances with non- 
positive measure are simple. Indeed, the third condition allows us to bound 
the depth of the search tree in terms of the measure of the original instance, 
while the second condition controls the number of branches below every node. 
Because of these properties, search trees of this kind are often called bounded 
search trees. A branching algorithm with a cleverly chosen branching step 
often offers a drastic improvement over a straightforward exhaustive search. 

We now present a typical scheme of applying the idea of bounded search 
trees in the design of parameterized algorithms. We first identify, in polyno- 
mial time, a small (typically of size that is constant, or bounded by a function 
of the parameter) subset S of elements of which at least one must be in some 
or every feasible solution of the problem. Then we solve |S] subproblems: 
for each element e of S, create one subproblem in which we include e in the 
solution, and solve the remaining task with a reduced parameter value. We 
also say that we branch on the element of S that belongs to the solution. 
Such search trees are analyzed by measuring the drop of the parameter in 
each branch. If we ensure that the parameter (or some measure bounded by 
a function of the parameter) decreases in each branch by at least a constant 
value, then we will be able to bound the depth of the search tree by a function 
of the parameter, which results in an FPT algorithm. 

It is often convenient to think of branching as of “guessing” the right 
branch. That is, whenever performing a branching step, the algorithm guesses 
the right part of an (unknown) solution in the graph, by trying all possibili- 
ties. What we need to ensure is that there will be a sequence of guesses that 
uncovers the whole solution, and that the total time spent on wrong guesses 
is not too large. 

We apply the idea of bounded search trees to VERTEX COVER in Sec- 
tion Section briefly discusses methods of bounding the number of 
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nodes of a search tree. In Section we give a branching algorithm for 
FEEDBACK VERTEX SET in undirected graphs. Section [3.4] presents an al- 
gorithm for a different parameterization of VERTEX COVER and shows how 
this algorithm implies algorithms for other parameterized problems such as 
ODD CYCLE TRANSVERSAL and ALMOST 2-SAT. Finally, in Section we 
apply this technique to a non-graph problem, namely CLOSEST STRING. 


3.1 VERTEX COVER 


As the first example of branching, we use the strategy on VERTEX COVER. 
In Chapter [] (Lemma|2.23), we gave a kernelization algorithm which in time 
O(n/m) constructs a kernel on at most 2k vertices. Kernelization can be 
easily combined with a brute-force algorithm to solve VERTEX COVER in 
time O(n/m+4"kO). Indeed, there are at most 2?¥ = 4* subsets of size at 
most k in a 2k-vertex graph. Thus, by enumerating all vertex subsets of size at 
most k in the kernel and checking whether any of these subsets forms a vertex 
cover, we can solve the problem in time O(n/m + 4*k°)). We can easily 
obtain a better algorithm by branching. Actually, this algorithm was already 
presented in Chapter |1| under the cover of the BAR FIGHT PREVENTION 
problem. 

Let (G,k) be a VERTEX COVER instance. Our algorithm is based on the 
following two simple observations. 


e For a vertex v, any vertex cover must contain either v or all of its 
neighbors N(v). 

e VERTEX COVER becomes trivial (in particular, can be solved opti- 
mally in polynomial time) when the maximum degree of a graph is 
at most 1. 


We now describe our recursive branching algorithm. Given an instance 
(G,k), we first find a vertex v € V(G) of maximum degree in G. If v is of 
degree 1, then every connected component of G is an isolated vertex or an 
edge, and the instance has a trivial solution. Otherwise, |N(v)| > 2 and we 
recursively branch on two cases by considering 


either v, or N(v) in the vertex cover. 


In the branch where v is in the vertex cover, we can delete v and reduce 
the parameter by 1. In the second branch, we add N(v) to the vertex cover, 
delete N[v] from the graph and decrease k by |N(v)| > 2. 

The running time of the algorithm is bounded by 
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(the number of nodes in the search tree) x (time taken at each node). 


Clearly, the time taken at each node is bounded by n°“). Thus, if 7(k) is the 
number of nodes in the search tree, then the total time used by the algorithm 
is at most T(k)nO, 


In fact, in every search tree 7 that corresponds to a run of a branching 
algorithm, every internal node of 7 has at least two children. Thus, if 
T has £ leaves, then the number of nodes in the search tree is at most 
2¢ — 1. Hence, to bound the running time of a branching algorithm, it 
is sufficient to bound the number of leaves in the corresponding search 
tree. 


In our case, the tree 7 is the search tree of the algorithm when run with 
parameter k. Below its root, it has two subtrees: one for the same algorithm 
run with parameter k — 1, and one recursive call with parameter at most 
k — 2. The same pattern occurs deeper in 7. This means that if we define a 
function T(k) using the recursive formula 


~ fTGü-1)+T(i-2) ifi>2, 
TEA { 1 otherwise, 


then the number of leaves of T is bounded by T(k). 
Using induction on k, we prove that T(k) is bounded by 1.6181*. Clearly, 
this is true for k = 0 and k = 1, so let us proceed for k > 2: 


T(k) =T(k—1) +T(k—2) < 1.6181"! + 1.618147? 
< 1.6181*~?(1.6181 + 1) < 1.6181*~?(1.6181)? < 1.6181". 


This proves that the number of leaves is bounded by 1.6181". Combined with 
kernelization, we arrive at an algorithm solving VERTEX COVER in time 
O(nym + 1.6181" kOW), 

A natural question is how did we know that 1.6181" is a solution to the 
above recurrence. Suppose that we are looking for an upper bound on function 
T(k) of the form T(k) < c-A*, where c > 0, \ > 1 are some constants. Clearly, 
we can set constant c so that the initial conditions in the definition of T'(k) 
are satisfied. Then, we are left with proving, using induction, that this bound 
holds for every k. This boils down to proving that 


cv AF > e+ AFTI tee), (3.1) 
since then we will have 


T(k) =T(k—1)+T(k—2)<e¢- 14-7 <e-d. 
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Observe that (3.1) is equivalent to A? > A + 1, so it makes sense to look 
for the lowest possible value of A for which this inequality is satisfied; this is 
actually the one for which equality holds. By solving equation A? = A + 1 for 
A > 1, we find that À = tv5 < 1.6181, so for this value of A the inductive 
proof works. 

The running time of the above algorithm can be easily improved using the 
following argument, whose proof we leave as Exercise 


Proposition 3.1. VERTEX COVER can be solved optimally in polynomial 
time when the maximum degree of a graph is at most 2. 


Thus, we branch only on the vertices of degree at least 3, which immediately 
brings us to the following upper bound on the number of leaves in a search 


tree: 
_ [T(k-1)+T(k-3) ifk>3, 
ah { 1 otherwise. 


Again, an upper bound of the form c- \* for the above recursive function can 
be obtained by finding the largest root of the polynomial equation A? = \?+1. 
Using standard mathematical techniques (and/or symbolic algebra packages) 
the root is estimated to be at most 1.4656. Combined with kernelization, this 
gives us the following theorem. 


Theorem 3.2. VERTEX COVER can be solved in time O(ny/m+1.4656 kf? ®). 


Can we apply a similar strategy for graphs of vertex degree at most 3? 
Well, this becomes more complicated as VERTEX COVER is NP-hard on this 
class of graphs. But there are more involved branching strategies, and there 
are faster branching algorithms than the one given in Theorem |3.2 


3.2 How to solve recursive relations 


For algorithms based on the bounded search tree technique, we need to bound 
the number of nodes in the search tree to obtain an upper bound on the 
running time of the algorithm. For this, recurrence relations are used. The 
most common case in parameterized branching algorithms is when we use 
linear recurrences with constant coefficients. There exists a standard tech- 
nique to bound the number of nodes in the search tree for this case. If the 
algorithm solves a problem of size n with parameter k and calls itself re- 
cursively on problems with decreased parameters k — d1, k — dz,...,k — dp, 
then (d1, d2,...,dp) is called the branching vector of this recursion. For ex- 
ample, we used a branching vector (1,2) to obtain the first algorithm for 
VERTEX COVER in the previous section, and a branching vector (1,3) for the 
second one. For a branching vector (d4, d2,...,dp), the upper bound T(k) 
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on the number of leaves in the search tree is given by the following linear 
recurrence: 


T(k) = T(k— d1) +T(k — d2) +-+- +T(k — dp). 


Again, for k < d, where d = maxj=1,2,....» di, we put the initial condition 
T(k) = 1. Assuming that new subproblems with smaller parameters can 
be computed in time polynomial in n, the running time of such recursive 
algorithm is T(k) - n°. 

If we now look for an upper bound of the form T(k) < c- A*, then the 
inductive step boils down to proving the following inequality: 


Ae SA a GA Boe A e, (3.2) 


Inequality (3.2) can be rewritten as P(A) > 0, where 





P(A) = ve h dd-da see’ d\d-4p (3.3) 


is the characteristic polynomial of the recurrence for T(k) (recall d = 
max;=1,2,...,» di). Using standard techniques of calculus it is not hard to show 
that if a polynomial P has a form as in (3.3), then P has a unique positive 
root Ap, and moreover P(A) < 0 for 0 < À < Ap and P(A) > 0 for A > Ao. 
This means that Ao is the best possible value that can be used in an upper 
bound for T(k). In the bibliographic notes we provide further references to 
the relevant literature on solving recursive formulas. 

The root Ao is often called the branching number corresponding to the 





branching vector (di, d2,...,dp). Hence, the running time of the considered 
branching algorithm is bounded by AKEn D, In Table [3.1] we give branching 
numbers corresponding to branching vectors (i, j) for i,j € {1,...,6}. 
Ga) 1 2 3 4 5 6 

1 |2.0000 1.6181 1.4656 1.3803 1.3248 1.2852 

2 1.4143 1.3248 1.2721 1.2366 1.2107 

3 1.2560 1.2208 1.1939 1.1740 

4 1.1893 1.1674 1.1510 

5 1.1487 1.1348 

6 1.1225 














Table 3.1: A table of branching numbers (rounded up) 


Two natural questions arise: 


e How good is the estimation of T (k) using the exponent of the correspond- 
ing branching number? 
e How well does T (k) estimate the actual size of the search tree? 
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The answer to the first question is “it is good”: up to a polynomial factor, 
the estimation is tight. The second question is much more difficult, since the 
actual way a branching procedure explores the search space may be more 
complex than our estimation of its behavior using recursive formulas. If, say, 
a branching algorithm uses several ways of branching into subproblems (so- 
called branching rules) that correspond to different branching vectors, and/or 
is combined with local reduction rules, then so far we do not know how 
to estimate the running time better than by using the branching number 
corresponding to the worst branching vector. However, the delicate interplay 
between different branching rules and reduction rules may lead to a much 
smaller tree than what follows from our imprecise estimations. 


3.3 FEEDBACK VERTEX SET 


For a given graph G and a set X C V(G), we say that X is a feedback vertex 
set of G if G—X is an acyclic graph (i.e., a forest). In the FEEDBACK VERTEX 
SET problem, we are given an undirected graph G and a nonnegative integer 
k, and the objective is to determine whether there exists a feedback vertex 
set of size at most k in G. In this section, we give a branching algorithm 
solving FEEDBACK VERTEX SET in time kO( . nO), 

It is more convenient for us to consider this problem in the more general 
setting of multigraphs, where the input graph G may contain multiple edges 
and loops. We note that both a double edge and a loop are cycles. We also 
use the convention that a loop at a vertex v contributes 2 to the degree of v. 

We start with some simple reduction rules that clean up the graph. At 
any point, we use the lowest-numbered applicable rule. We first deal with 
the multigraph parts of G. Observe that any vertex with a loop needs to be 
contained in any solution set X. 


Reduction FVS.1. If there is a loop at a vertex v, delete v from the graph 
and decrease k by 1. 


Moreover, notice that the multiplicity of a multiple edge does not influence 
the set of feasible solutions to the instance (G, k). 


Reduction FVS.2. If there is an edge of multiplicity larger than 2, reduce 
its multiplicity to 2. 


We now reduce vertices of low degree. Any vertex of degree at most 1 does 
not participate in any cycle in G, so it can be deleted. 


Reduction FVS.3. If there is a vertex v of degree at most 1, delete v. 


Concerning vertices of degree 2, observe that, instead of including into the 
solution any such vertex, we may as well include one of its neighbors. This 
leads us to the following reduction. 


58 3 Bounded search trees 


Reduction FVS.4. If there is a vertex v of degree 2, delete v and connect 
its two neighbors by a new edge. 


Two remarks are in place. First, a vertex v in Reduction [FVS.4]cannot have 
a loop, as otherwise Reduction [FVS.1|should be triggered on v instead. This 
ensures that the neighbors of v are distinct from v, hence the rule may be ap- 
plied and the safeness argument works. Second, it is possible that v is incident 
to a double edge: in this case, the reduction rule deletes v and adds a loop 
to a sole “double” neighbor of v. Observe that in this case Reduction 
will trigger subsequently on this neighbor. 

We remark that after exhaustively applying these four reduction rules, the 
resulting graph G 


(P1) contains no loops, 
(P2) has only single and double edges, and 
(P3) has minimum vertex degree at least 3. 


Moreover, all rules are trivially applicable in polynomial time. From now 
on we assume that in the input instance (G,k), graph G satisfies properties 
(P1)-(P3). 

We remark that for the algorithm in this section, we do not need prop- 
erties (P1) and (P2). However, we will need these properties later for the 
kernelization algorithm in Section 

Finally, we need to add a rule that stops the algorithm if we already 
exceeded our budget. 


Reduction FVS.5. If k < 0, terminate the algorithm and conclude that 
(G, k) is a no-instance. 


The intuition behind the algorithm we are going to present is as follows. 
Observe that if X is a feedback vertex set of G, then G — X is a forest. 
However, G — X has at most |V(G)| — |X| — 1 edges and thus G — X 
cannot have “many” vertices of high degree. Thus, if we pick some f(k) 
vertices with the highest degrees in the graph, then every solution of 
size at most k must contain one of these high-degree vertices. In what 
follows we make this intuition work. 


Let (v1, V2,..., Un) be a descending ordering of V(G) according to vertex 
degrees, i.e., d(vi) > d(v2) > +++ > d(vp). Let V3, = {v1,..., U3}. Let us 
recall that the minimum vertex degree of G is at least 3. Our algorithm for 
FEEDBACK VERTEX SET is based on the following lemma. 


Lemma 3.3. Every feedback vertex set in G of size at most k contains at 
least one vertex of Vzķ- 


Proof. To prove this lemma we need the following simple claim. 
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Claim 3.4. For every feedback verter set X of G, 


S (aw) — 1) > |E@)| - |V(@)| +1. 


vEX 


Proof. Graph F = G — X is a forest and thus the number of edges in F is at 
most |V(G)| — |X| — 1. Every edge of E(G) \ E(F) is incident to a vertex of 
X. Hence 
Y d(v) + |V(@)| -|X|-1 > |E@). 
vEx 
al 


Targeting a contradiction, let us assume that there is a feedback vertex set 
X of size at most k such that X N V3, = Ø. By the choice of Vz, for every 
v € X, d(v) is at most the minimum of vertex degrees from V3,. Because 
|X| < k, by Claim 3.4] we have that 


3k 


Yd) - 1) = 3- (X a) -1)) =3-(E@|-V@|+. 


i=l vEX 


In addition, we have that X C V (G) \ V3,, and hence 


X (avi) — 1) > $ (ae) - 1) = (EG) -IV (G) +1). 


i>3k vEX 


Therefore, 


> (dwi) = 1) > 4- (E(G)| - |V (G)| + 1). 


However, observe that S7i_, d(v:) = 2|E(G)|: every edge is counted twice, 
once for each of its endpoints. Thus we obtain 


4-(|E(G)| — |V(G@)| + 1) <u (vi) — 1) = 2|E(@)| — |V(G)], 


which implies that 2|E(G)| < 3|V(G)|. However, this contradicts the fact 
that every vertex of G is of degree at least 3. 














We use Lemma [3.3]to obtain the following algorithm for FEEDBACK VER- 
TEX SET. 


Theorem 3.5. There exists an algorithm for FEEDBACK VERTEX SET run- 
ning in time (3k)! -nO, 


Proof. Given an undirected graph G and an integer k > 0, the algorithm 


works as follows. It first applies Reductions |FVS.1| |FVS.2} |FVS.3] |F VS.4 
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and [FVS.5] exhaustively. As the result, we either already conclude that we 
are dealing with a no-instance, or obtain an equivalent instance (G’, k’) such 
that G” has minimum degree at least 3 and k’ < k. If G’ is empty, then we 
conclude that we are dealing with a yes-instance, as k’ > 0 and an empty set 
is a feasible solution. Otherwise, let V3z- be the set of 3k’ vertices of G” with 
largest degrees. By Lemma [3.3] every solution X to the FEEDBACK VERTEX 
SET instance (G’,k’) contains at least one vertex from V3,’. Therefore, we 
branch on the choice of one of these vertices, and for every vertex v € V3z., 
we recursively apply the algorithm to solve the FEEDBACK VERTEX SET 
instance (G’ — v, k’ — 1). If one of these branches returns a solution X’, then 
clearly X’ U {v} is a feedback vertex set of size at most k’ for G”. Else, we 
return that the given instance is a no-instance. 

At every recursive call we decrease the parameter by 1, and thus the height 
of the search tree does not exceed k’. At every step we branch in at most 3k’ 
subproblems. Hence the number of nodes in the search tree does not exceed 
(3k’)*’ < (3k)*. This concludes the proof. 














3.4 VERTEX COVER ABOVE LP 


Recall the integer linear programming formulation of VERTEX COVER and 
its relaxation LPVC(G): 


min seva) Ly 
subject to Zu + £y > 1 for every uv E€ E(G), 
O0<a,<1 for every v € V(G). 


These programs were discussed in Section If the minimum value of 
LPVC(G) is vc*(G), then the size of a minimum vertex cover is at least 
ve*(G). This leads to the following parameterization of VERTEX COVER, 
which we call VERTEX COVER ABOVE LP: Given a graph G and an integer 
k, we ask for a vertex cover of G of size at most k, but instead of seeking an 
FPT algorithm parameterized by k as for VERTEX COVER, the parameter 
now is k — vc*(G). In other words, the goal of this section is to design an 
algorithm for VERTEX COVER on an n-vertex graph G with running time 
f(k —ve*(G)) -n° for some computable function f. 

The parameterization by k — vc*(G) falls into a more general theme of 
above guarantee parameterization, where, instead of parameterizing purely 
by the solution size k, we look for some (computable in polynomial time) 
lower bound ¢ for the solution size, and use a more refined parameter k — £, 
the excess above the lower bound. Such a parameterization makes perfect 
sense in problems where the solution size is often quite large and, conse- 
quently, FPT algorithms in parameterization by k may not be very efficient. 
In the VERTEX COVER problem, the result of Section —a kernel with at 


3.4 VERTEX COVER ABOVE LP 61 


most 2k vertices — can be on one hand seen as a very good result, but on 
the other hand can be an evidence that the solution size parameterization for 
VERTEX COVER may not be the most meaningful one, as the parameter can 
be quite large. A more striking example is the MAXIMUM SATISFIABILITY 
problem, studied in Section [2.3.2 here an instance with at least 2k clauses is 
trivially a yes-instance. In Section [9.2] we study an above guarantee parame- 
terization of a variant of MAXIMUM SATISFIABILITY, namely MAx-Er-SAT. 
In Exercise [9.3] we also ask for an FPT algorithm for MAXIMUM SATISFIA- 
BILITY parameterized by k — m/2, where m is the number of clauses in the 
input formula. The goal of this section is to study the above guarantee pa- 
rameterization of VERTEX COVER, where the lower bound is the cost of an 
optimum solution to an LP relaxation. 


Before we describe the algorithm, we fix some notation. By optimum 
solution x = (Xv),ey(q) to LPVC(G), we mean a feasible solution with 
1 > x, > Ofor all v € V(G) that minimizes the objective function (sometimes 
called the cost) w(x) = vevi(G) £y. By Proposition ele any graph G 
there exists an optimum half-integral solution of LPVC(G), i.e., a solution 
with x, € {0,4,1} for all v € V(G), and such a solution can be found in 
polynomial time. 

Let vc(G) denote the size of a minimum vertex cover of G. Clearly, ve(G) > 
ve*(G). For a half-integral solution x = (y),cy(g andi € {0, 3, 1}, we define 
Vx ={v eV : ay =i}. We also say that x = {ry} yev(q) is all-5-solution if 


Ty = 4 for every v € V(G). Because the all-4-solution is a feasible solution, 


we have that vc*(G) < KO, Furthermore, we define the measure of an 
instance (G,k) to be our parameter of interest u(G, k) = k — vc* (G). 

Recall that in Section [2.5]we have developed Reduction [VC.4]for VERTEX 
Cover. This reduction, if restricted to half-integral solutions, can be stated 
as follows: for an optimum half-integral LP VC(G) solution x, we (a) conclude 
that the input instance (G,k) is a no-instance if w(x) > k; and (b) delete 
Vo U VĚ and decrease k by |Vř| otherwise. As we are now dealing with 


measure u(G,k) = k — vc* (G), we need to understand how this parameter 
changes under Reduction 


Lemma 3.6. Assume an instance (G’,k’) is created from an instance (G, k) 
by applying Reduction to a half-integral optimum solution x. Then 
ve*(G) — ve*(G’) = ve(G) — ve(G’) = |V| = k — k'. In particular, 
H(G", k’) = w(G,k). 

We remark that, using Exercise the statement of Lemma is true 
for any optimum solution x, not only a half-integral one (see Exercise [3.19). 


However, the proof for a half-integral solution is slightly simpler, and, thanks 
to Proposition we may work only with half-integral solutions. 


Proof (of Lemma [3.6). Observe that every edge of G incident to Vý has its 
second endpoint in V;*. Hence, we have the following: 
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e For every vertex cover Y of G’, Y U VÝ is a vertex cover of G and, conse- 
quently, ve(G) < ve(G’) + |VĚ|. 

e For every feasible solution y’ to LPVC(G’), if we define a vector y = 
(Yw)vev(G) aS Yv = Y, for every v E€ V(G") and yv = x, for every v € 
V(G) \ V(G’), then we obtain a feasible solution to LPVC(G) of cost 
w(y’) + |VE|; consequently, ve* (G) < ve*(G’) + |VĚI. 

Now it suffices to prove the reversed versions of the two inequalities obtained 

above. Theorem [2.19] ensures that ve(G’) < vce(G) — |V*|. Moreover, since x 

restricted to V(G’) is a feasible solution to LPVC(G’) of cost ve*(G) — |V**|, 

we have vc*(G’) < v (G) — |VĚI. 

In Theorem [2.21]we simply solved LPVC(G), and applied Reduction [VC.4] 
to obtain a kernel with at most 2k vertices. In this section we would like to do 
something slightly stronger: to apply Reduction [VC.4]as long as there exists 
some half-integral optimum solution x that is not the all-4-solution. Luckily, 
by a simple self-reduction trick, we can always detect such solutions. 














Lemma 3.7. Given a graph G, one can in O(mn3/?) time find an optimum 
solution to LPVC(G) which is not the all-4-solution, or correctly conclude 
that the all-4-solution is the unique optimum solution to LPVC(G). More- 
over, the returned optimum solution is half-integral. 


Proof. First, use Proposition [2.24] to solve LPVC(G), obtaining an optimum 
half-integral solution x. If x is not the all- 4-solution, then return x. Other- 
wise, proceed as follows. 

For every v € V(G), use Proposition [2.24] again to solve LPVC(G — v), 
obtaining an optimum half-integral solution x”. Define a vector x"’° as x”, ex- 
tended with a value x, = 1. Note that x”? is a feasible solution to LPVC(G) 
of cost w(x”) = w(x”) + 1 = vc*(G — v) + 1. Thus, if for some v € V(G) 
we have w(x”) = ve*(G—v) < vc*(G) — 1, then x”° is an optimum solu- 
tion for LPVC(G). Moreover, x™° is half-integral, but is not equal to the 
all- 3-solution due to the value at vertex v. Hence, we may return x”’°. 

We are left with the case 


vc*(G — v) > ve*(G)—1 for every v € V(G). (3.4) 


We claim that in this case the all- 4-solution is the unique solution to 
LPVC(G); note that such a claim would conclude the proof of the lemma, as 
the computation time used so far is bounded by O(mn?/?) (n+1 applications 
of Proposition [2.24). Observe that, due to Proposition [2.24] both 2 vc*(G—v) 
and 2 vc*(G) are integers and, consequently, we obtain the following strength- 


ening of (3.4): 
1 
ve*(G — v) > ve*(G) — 3 for every v € V (G). (3.5) 


By contradiction, let x be an optimum solution to LPVC(G) that is not 
the all-4-solution. As the all-4-solution is an optimum solution to LPVC(G), 
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for some v € V(G) we have x, > 4. However, then x, restricted to G—v, is a 
feasible solution to LPVC(G—v) of cost w(x)—ay = ve*(G)—2, < ve*(G)—-4, 
a contradiction to (3.5). 


Lemma [3.7|allows us to enhance Reduction to the following form. 


Reduction VC.5. Invoke the algorithm of Lemma [8.7]and, if an optimum 
solution x is returned, then apply Reduction to G and solution x. 














If Reduction [VC.5]is not applicable, then we know not only that the all- l- 
solution is an optimum solution to LPVC(G), but also that this is the unique 
optimum solution. This property is crucial for our branching algorithm, en- 
capsulated in the next theorem. 


Theorem 3.8. There exists an algorithm solving VERTEX COVER ABOVE 
LP in time 4'-ve"(@) . nOQ), 


Proof. Our algorithm for VERTEX COVER ABOVE LP is almost the same as 
the branching algorithm described for VERTEX COVER in Section [3.1] After 
Reduction is applied exhaustively, we pick an arbitrary vertex v in 
the graph and branch on it. In other words, in one branch, we add v into 
the vertex cover, decrease k by 1, and delete v from the graph, and in the 
other branch, we add N(v) into the vertex cover, decrease k by |N(v)|, and 
delete N|v] from the graph. The correctness of this algorithm follows from 
the safeness of Reduction[VC.5|and the fact that the branching is exhaustive. 

Although the algorithm is very similar to the one of Section [3.1] we analyze 
our algorithm in a completely different way, using the measure u(G, k) = k— 
ve*(G). In Lemma 3.6] we have proved that Reduction [VC.4]does not change 
the measure; as Reduction [VC.5]is in fact an application of Reduction [VC.4] 
to a specific half-integral solution x, the same holds for Reduction 
Hence, it remains to analyze how the measure changes in a branching step. 

Consider first the case when we pick v to the vertex cover, obtaining an 
instance (G’, k’) = (G—v, k—1). We claim that ve*(G’) > ve*(G)—4. Suppose 
that this is not the case. Let x’ be an optimum solution to LPVC(G’). We 
have w(x’) < vc*(G) — 1 and we can obtain an optimum solution x” to 
LPVC(G) by extending x’ with a new variable x, = 1 corresponding to v. 
But this contradicts our assumption that the all-$-solution is the unique 
optimum solution to LPVC(G). Hence, ve*(G’) > ve*(G) — 4, which implies 
that u(G’,k’) < u(G, k) — 5. 

In the second case, let p = |N(v)|. Recall that here the new instance is 
(G',k’) = (G — Nv], k — p). We claim that ve*(G’) > ve*(G) — p + 4, which 
would imply again (G’, k’) < u(G,k) — Z. Assume the contrary: let x’ be an 
optimum solution to LPVC(G’) and suppose that w(x’) < vc*(G)-— p. Define 
a feasible solution x” to LPVC(G) by extending x’ with x, = 0 and x, = 1 
for every u € N(v). Clearly, w(x”) = w(x’) + p, thus x” is an optimum 
solution to LPVC(G), contradicting the assumption that the all-5-solution is 
the unique optimum solution. 


64 3 Bounded search trees 


We have shown that the preprocessing rule does not increase the measure 
u(G,k), and that the branching step results in a (4, 5) decrease in u(G, k). 
As a result, we obtain recurrence T (u) < 2T (u — 4) for the number of leaves 
in the search tree. This recurrence solves to 4” = 4*-ve"(@) and we obtain a 


A(k-ve"(@)) . nO()-time algorithm for VERTEX COVER ABOVE LP. 














Let us now consider a different lower bound on the size of a minimum 
vertex cover in a graph, namely the size of a maximum matching. Observe 
that, if graph G contains a matching M, then for k < |M| the instance 
(G,k) is a trivial no-instance of VERTEX Cover. Thus, for a graph G with 
a large maximum matching (e.g., when G has a perfect matching) the FPT 
algorithm for VERTEX COVER of Section -T]is not practical, as in this case 
k has to be quite large. 

This leads to a second above guarantee variant of the VERTEX COVER 
problem, namely the VERTEX COVER ABOVE MATCHING problem. On input, 
we are given an undirected graph G, a maximum matching M and a positive 
integer k. As in VERTEX COVER, the objective is to decide whether G has 
a vertex cover of size at most k; however, now the parameter is k — |M]. 
By the weak duality of linear programs, it follows that vc*(G) > |M]| (see 
Exercise [2.24] and the corresponding hint for a self-contained argument) and 
thus we have that k — vc*(G) < k — |M|. Consequently, any parameterized 
algorithm for VERTEX COVER ABOVE LP is also a parameterized algorithm 
for VERTEX COVER ABOVE MATCHING, and Theorem[3.8lyields the following 
interesting observation. 


Theorem 3.9. VERTEX COVER ABOVE MATCHING can be solved in time 
4k-1M] . p20), 


The VERTEX COVER ABOVE MATCHING problem has been at the center 
of many developments in parameterized algorithms. The reason is that faster 
algorithms for this problem also yield faster algorithms for a host of other 
problems. Just to show its importance, we design algorithms for ODD Cy- 
CLE TRANSVERSAL and ALMOST 2-SAT by making use of the algorithm for 
VERTEX COVER ABOVE MATCHING. 

A subset X C V(G) is called an odd cycle transversal of G if G — X isa 
bipartite graph. In ODD CYCLE TRANSVERSAL, we are given an undirected 
graph G with a positive integer k, and the goal is to determine whether G 
has an odd cycle transversal of size at most k. _ 

For a given graph G, we define a new graph G as follows. Let V; = {u; 


u € V(G)} for i € {1,2}. The vertex set V(G) consists of two copies of V (G), 


ie. V(G) = Vi U V2, and 
E(G) = {uuz : u€ V(G@)}U {uvi : uv € E(G),i € {1, 2}}. 


In other words, G is obtained by taking two disjoint copies of G and by adding 
a perfect matching such that the endpoints of every matching edge are the 
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copies of the same vertex. In particular, M = {ujug : u € V(G)} is a perfect 
matching in G and |M| = n. The graph G has some interesting properties 
which are useful for designing an algorithm for ODD CYCLE TRANSVERSAL. 


Lemma 3.10. Let G be a graph on n vertices. Then G has an odd cycle 
transversal of size at most k if and only if G has a vertex cover of size at 
most n+k. 


Proof. Let X be an odd cycle transversal of size at most k in G, and let (A, B) 
be a bipartition of G — X. For i € {1,2}, let A;, B; and X; be the copies of 
A, B and X in Vj, respectively. Observe that A; U Bə is an independent set 
in G and thus Ao U By U X1 U X29 is a vertex cover of G. However, 





(A| + |Bil) + |X1| + |Xe] < (n-—k) +k +k <nt+k. 


This completes the forward direction of the proof. _ 

In the other direction, let Y be a vertex cover of G of size at most n + k. 
Then I = V(G) \ Y is an independent set of G of size at least n — k. Let 
A = I N Vj, and B=In Vo. Let A and B denote the vertices corresponding 
to A and B in V(G), respectively. Since AU B is independent, we have that 
AN B=9. Therefore, AU B is of size at least n — k. Moreover, G[A U B] is 
bipartite, as G[A] and G[B] are independent sets. Thus, V(G) \ (AU B) is an 
odd cycle transversal of size at most k for G. This completes the proof. 














Using Lemma and Theorem [3.9] we get the following result for ODD 
CYCLE TRANSVERSAL. 


Theorem 3.11. ODD CYCLE TRANSVERSAL can be solved in time 45n ® 


Proof. Given an input (G,k) to ODD CYCLE TRANSVERSAL, we first con- 
struct the graph G. Let M = {uyu2 : u € V(G)} be a perfect matching of 
G. Now we apply Theorem [3.9]to the instance (G, M, n + k) to check in time 
gntk—n . nO) = 4k.nO whether G has a vertex cover of size at most n+ k, 
where n = |V (G)|. If a vertex cover of size at most n + k in G is found, then 
using Lemma [8.I0|we obtain an odd cycle transversal of size at most k in G. 
Otherwise, by Lemma [8.10] G has no odd cycle transversal of size at most 
k. 














In fact, the fastest currently known FPT algorithm for ODD CYCLE 
TRANSVERSAL uses an improved version of the algorithm described 
in Theorem as a subroutine. We will give a 3*n°-time algorithm for 
this problem in the next chapter using a different technique, called iterative 
compression. 

Next, we give an algorithm for ALMOST 2-SAT using the 4*n°)-time 
algorithm for VERTEX COVER ABOVE MATCHING. In the ALMOST 2-SAT 
problem, we are given a 2-CNF formula y and an integer k, and the question 
is whether one can delete at most k clauses from y to make it satisfiable. 
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In a variable-deletion variant, called VARIABLE DELETION ALMOST 2-SAT, 
we instead allow deleting at most k variables; a variable is deleted together 
with all clauses containing it. (One can think of deleting a variable as setting 
it both to true and false at the same time, in this way satisfying all the 
clauses containing it.) Exercises and ask you to prove that these 
two variants are equivalent, and an f(k)nO™-time algorithm for one of them 
implies an f(k)n©-time algorithm for the other one (with the same function 
f). Hence, it suffices to focus only on the variable-deletion variant. 


Theorem 3.12. VARIABLE DELETION ALMOST 2-SAT can be solved in 
Akn oO time. 


Proof. We proceed similarly as in the case of ODD CYCLE TRANSVERSAL: in 
polynomial time, we construct a VERTEX COVER ABOVE MATCHING instance 
(G, M,|M|+k) that is equivalent to the input VARIABLE DELETION ALMOST 
2-SAT instance (y, k). Then the theorem follows from an application of the 
algorithm of Theorem [3.9] to the instance (G, M,|M| + k). 

We construct the graph G as follows. For every variable x, and for every 
literal Z € {x, =x}, we construct a vertex vg, and connect the vertices vy and 
vz by an edge. Let M be a set of edges defined in this step; note that M is 
a matching in G. The intuitive meaning of vertices vz is that picking vz into 
a vertex cover corresponds to valuating the variable of £ so that £ is true. 
For every edge vv. E€ M, we expect to take one endpoint into a vertex 
cover, which corresponds to setting the value of variable x. However, in k 
edges v,U42 E€ M we are allowed to take both endpoints into a vertex cover, 
corresponding to deleting x or, equivalently, setting x both to true and false 
at once. 

To encode clauses, we proceed as follows. For every binary clause 41 V 42, 
we add an edge vg, vg,, So that either ve, or ve, needs to be taken into a vertex 
cover. For every unary clause with literal /, we add a new vertex of degree 1, 
adjacent to ve. 

This concludes the construction of the graph G and the instance (G, M,|M| 
+k). We refer to Fig. [3.1] for an illustration of the construction. We claim that 
(y, k) is a yes-instance of VARIABLE DELETION ALMOST 2-SAT if and only 
if (G, M,|M| + k) is a yes-instance of VERTEX COVER ABOVE MATCHING. 

In one direction, let X be a set of at most k variables of y such that p—X, 
the formula y with all variables of X deleted, has a satisfying assignment 
w. We define a set Y C V(G) as follows. First, for every « € X, we put 
both vz and v~s into Y. Second, for every variable x ¢ X, we put vz into 
Y if y(x) = T and otherwise, if y(x) = L, we put v-v into Y. Clearly 
IY] =|M|+|X| < |M] +k. 

It remains to argue that Y is a vertex cover of G. By construction, Y 
contains at least one endpoint of every edge in M. For the remaining edges, 
consider a clause C. Since X is a feasible solution to (y,k), there exists a 
literal Z in C that is either evaluated to true by wv, or its variable is deleted 
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Vay Vag Urs Uxa 





Unz Uro Vaz Vax 


Fig. 3.1: The graph G for the formula y = (xı V 722) A (a2 V £3) A (“z2 V 
723) A (naz V 24) A (724). The thick edges are the edges of the matching M 


(belongs to X). In both cases, ve € Y and the edge corresponding to the 
clause C is covered by Y. 

In the other direction, let Y be a vertex cover of G, and assume |Y| < 
|M| + k. Without loss of generality, we may assume that Y C V(M);: if, for 
some unary clause C = £, the corresponding degree 1 neighbor of vg belongs 
to Y, we replace it with the vertex ve. Let X be the set of these variables x 
for which both vz and v-z belong to Y. Since Y needs to contain at least one 
endpoint of every edge of M, we have |X| < k. Let us define an assignment 
w on the variables outside X as follows: y(x) = T if vz € Y, and y(x) = L if 
Vax E€ Y. We claim that v is a satisfying assignment of y — X; note that such 
a statement implies that X is a feasible solution to the VARIABLE DELETION 
ALMOST 2-SAT instance (y, k), concluding the proof of the theorem. 

To this end, consider a clause C, and assume no variable of C belongs to 
X. If C = 4 V é9 is a binary clause, then the edge vg, ve, ensures that either 
ve, or ve, belongs to Y, and the corresponding literal is evaluated to true in 
the assignment Ņ%. If C = @ is a unary clause, then the corresponding edge 
incident to vg, together with the assumption that Y C V(M), implies that 
ve € Y and £ is evaluated to true by w. 

Consequently, (p, k) is a yes-instance to VARIABLE DELETION ALMOST 
2-SAT if and only if (G,M,|M| + k) is a yes-instance to VERTEX COVER 
ABOVE MATCHING. This concludes the proof of the theorem. 














3.5 CLOSEST STRING 


In the last part of this chapter we give yet another example of branching 
algorithms, this time for a string problem called CLOSEST STRING. Here, 
we are given k strings x1,...,2,%, each string over an alphabet X and of 
length L, and an integer d. The question is whether there exists a string y of 
length L over X such that dy(y,2;) < d for all i € {1,...,k}. Here, dy (x,y) 
is the Hamming distance between strings x and y, that is, the number of 
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positions where x and y differ. We call any such string y a center string. In 
this section we consider the parameterization by d, the maximum allowed 
distance between the center string and the input strings. 

Let x be a string over alphabet X. We denote the letter on the pth position 
of x as x|p]. Thus x = 2[1]2[2]---a[L] for a string of length L. We say that 
string x and y differ on the pth position if x[p] 4 y[p]. 

Given a set of k strings, each of length L, we can think of these strings 
as a k x L character matrix. By columns of the set of strings, we mean the 
columns of this matrix. That is, the 7-th column is the sequence of letters 
xıl], z2[j], ---, £j]. We call a column bad if it contains at least two different 
symbols from alphabet X, and good otherwise. Clearly, if the j-th column is 
a good one, then we have an obvious greedy choice for the j-th letter of the 
solution: y[j] = v1[j] = valj] = ... = xp[j]. Thus, we obtain the following 
reduction rule. 








Reduction CS.1. Delete all good columns. 


It is straightforward to implement Reduction in linear time. 
We now observe that we cannot have too many bad columns in a yes- 
instance. 


Lemma 3.13. For every yes-instance of CLOSEST STRING, the correspond- 
ing k x L matriz contains at most kd bad columns. 


Proof. Fix a center string y. For every bad column j there exists a string £i(j) 
such that zig) [j] 4 ylj]. Since every string x; differs from y on at most d 
positions, for every i we have i(j) = i for at most d positions j. Consequently, 
there are at most kd bad columns. 














Reduction CS.2. If there are more than kd bad columns, then conclude 
that we are dealing with a no-instance. 


We now give an intuition behind the algorithm. 


Fix a center string y. The idea of our algorithm is to start with one 
of the given strings, say xı, as a “candidate string”, denoted by z. As 
long as there is a string x;, i € {1,...,k}, such that dy(x;,z) >d+1, 
then for at least one of the positions p where z and x; differ, we have 
that y[p] = 2;[p]. Thus we can try recursively d + 1 ways to move 
the candidate z string “closer” to y; moving closer here means that we 
select a position p on which the candidate string z and x; differ and set 
z[p] := x;[p]. As at every step we move closer to y, and at the beginning 
z = xı and dy (z1,y) < d, we obtain a bounded number of possibilities. 


Let us move to formal details. 


Theorem 3.14. CLOSEST STRING can be solved in time O(kL +kd(d+1)?). 


3.5 CLOSEST STRING 69 


Proof. First, we apply Reductions and Unless we have resolved 
the instance already, we are left with k strings 71, £2,..., £k, each of length 
L < kd. The time spent so far is O(kL), that is, linear in the input size. 

Recursively, we solve the following augmented problem: given a candidate 
string z and an integer £ < d, determine whether there exists a center string 
y for the strings 71, £2,...,£ with the additional property dy(y,z) < 4 
Observe that, to solve CLOSEST STRING on our instance, it suffices to solve 
our augmented problem for z = x, and £ = d. 

Given z and £, we first perform the following checks. If z is a center string 
itself, then we return z. Otherwise, if 2 = 0, then we return that there is no 
such center string y. In the remaining case, £ > 0 and there exists a string zi 
with dy(a;, z) > d. Let P bea set of arbitrary d+1 positions on which x; and 
z differ. Observe that for every center string y we have y[p| = x;[p] for at least 
one position p € P. Hence, we branch into |P| = d+ 1 subcases: for every 
p € P, we define zp to be equal z except for position p where z,[p| = 2;[pl, 
and we recursively solve our augmented problem for the pair (zp, — 1). To 
show correctness of this branching, observe that if there exists a center string 
y with dy(z,y) < £, then for a position p € P satisfying x;[p] = y[p] we have 
dyn) < dy(z,y)—1<£-1. 

Concerning the running time of the algorithm, note that we build a search 
tree of depth at most d, and every node of the search tree has at most d + 1 
children. Thus, the size of the search tree does not exceed O((d+1)¢). With 
small technical work which we omit here, every step of the algorithm can be 
implemented in linear time. This completes the proof. 














Exercises 


3.1 (Z). Prove Proposition [3.1] 


3.2 (Z). Show that CLIQUE and INDEPENDENT SET, parameterized by the solution size 
k, are FPT on r-regular graphs for every fixed integer r. Also show that these problems 
are FPT with combined parameters k +r. 


3.3. Show that any graph has at most 2% inclusion-wise minimal vertex covers of size 
at most k. Furthermore, show that given G and k, we can enumerate all inclusion-wise 
minimal vertex covers of G of size at most k in time 2*n©), 


3.4 (Z). In the CLUSTER VERTEX DELETION problem, we are given a graph G and 
an integer k, and the task is to delete at most k vertices from G to obtain a cluster 
graph (a disjoint union of cliques). Obtain a 3*nOC)-time algorithm for CLUSTER VERTEX 
DELETION. 


3.5 (Z). In the CLUSTER EDITING problem, we are given a graph G and an integer k, 
and the objective is to check whether we can turn G into a cluster graph (a disjoint union 
of cliques) by making at most k edge editions, where each edition is adding or deleting one 
edge. Obtain a 3*nO()-time algorithm for CLUSTER EDITING. 
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3.6 (Z). An undirected graph G is called perfect if for every induced subgraph H of G, 
the size of the largest clique in H is the same as the chromatic number of H. In this exercise 
we consider the ODD CYCLE TRANSVERSAL problem, restricted to perfect graphs. 

Recall that Baercia Oe for a kernel with O(k) vertices for this problem. In this 
exercise, we ask for a 3*nCU)-time branching algorithm. 


3.7. Let F be a set of graphs. We say that a graph G is F-free if G does not contain 
any induced subgraph isomorphic to a graph in F; in this context the elements of F are 
sometimes called forbidden induced subgraphs. For a fixed set F, consider a problem where, 
given a graph G and an integer k, we ask to turn G into a F-free graph by: 


(vertex deletion) deleting at most k vertices; 

(edge deletion) deleting at most k edges; 

(completion) adding at most k edges; 

(edition) performing at most k editions, where every edition is adding or deleting one 
edge. 


Prove that, if F is finite, then for each of the four aforementioned problems there exists 
a 20(k)nOQ)-time FPT algorithm. (Note that the constants hidden in the O()-notation 
may depend on the set F.) 


3.8. In the VERTEX CovER/OCT problem, we are given an undirected graph G, an integer 
£, and an odd cycle transversal Z of size at most k, and the objective is to test whether G 
has a vertex cover of size at most £. Show that VERTEX CoveR/OCT admits an algorithm 
with running time 2*n©Q), 


3.9. In this exercise we consider FPT algorithms for FEEDBACK ARC SET IN TOURNA- 
MENTS and FEEDBACK VERTEX SET IN TOURNAMENTS. Recall that a tournament is a 
directed graph, where every pair of vertices is connected by exactly one directed edge (in 
one of the directions). 


1. Let G be a digraph that can be made into a tournament by adding at most k > 2 
directed edges. Show that if G has a cycle then it has a directed cycle of length at 
most 3Vk. 

2. Show that FEEDBACK ARC SET IN TOURNAMENTS admits a branching algorithm with 
running time (3Vk)¥n@@Q), 

3. Show that FEEDBACK VERTEX SET IN TOURNAMENTS admits a branching algorithm 
with running time 3*n@@), 

4. Observe that, in the FEEDBACK ARC SET IN TOURNAMENTS problem, we can equiv- 
alently think of reversing an edge instead of deleting it. Use this observation to show 


a branching algorithm for FEEDBACK Arc SET IN TOURNAMENTS with running time 
3knOQ), 


3.10 (Æ). A bipartite tournament is an orientation of a complete bipartite graph, meaning 
its vertex set is a union of two disjoint sets Vi and V2 and there is exactly one arc between 
every pair of vertices u and v such that u € Vi and v € V2. 


1. Show that a bipartite tournament has a directed cycle if and only if it has a directed 
cycle on four vertices. 

2. Show that DIRECTED FEEDBACK VERTEX SET and DIRECTED FEEDBACK ARC SET 
admit algorithms with running time 4*n°@) on bipartite tournaments. 


3.11 (È). A graph is chordal if it does not contain a cycle on at least four vertices as 
an induced subgraph. A triangulation of a graph G is a set of edges whose addition turns 
G into a chordal graph. In the CHoRDAL COMPLETION problem, given a graph G and an 
integer k, we ask whether G admits a triangulation of size at most k. 
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1. Show that CHoRDAL COMPLETION admits an algorithm with running time kO(*)nO@), 

2. Show that there is a one-to-one correspondence between inclusion-wise minimal tri- 
angulations of a cycle with Z vertices and binary trees with Z — 2 internal nodes. 
Using this correspondence show that a cycle on £ vertices has at most 4-2 minimal 
triangulations. 

3. Use the previous point to obtain an algorithm with running time 20()n©°@® for 
CHORDAL COMPLETION. 


You may use the fact that, given a graph G, one can in polynomial time check if G is 
a chordal graph and, if this is not the case, find in G an induced cycle on at least four 
vertices. 


3.12. In the Min-OneEs-r-SAT problem, we are given an r-CNF formula ¢ and an integer 
k, and the objective is to decide whether there exists a satisfying assignment for ¢ with 
at most k variables set to true. Show that Min-Ones-r-SAT admits an algorithm with 
running time f(r,k)nO“ for some computable function f. 


3.13. In the Min-2-SAT problem, we are given a 2-CNF formula ¢ and an integer k, and 
the objective is to decide whether there exists an assignment for ¢ that satisfies at most k 
clauses. Show that Mrn-2-SAT can be solved in time 2*n@), 


3.14. In the Minimum MaximaL MATCHING problem, we are given a graph G and an 
integer k, and the task is to check if G admits an (inclusion-wise) maximal matching with 
at most k edges. 


1. Show that if G has a maximal matching of size at most k, then V(M) is a vertex cover 
of size at most 2k. 

2. Let M be a maximal matching in G and let X C V(M) be a minimal vertex cover 
in G. Furthermore, let Mı be a maximum matching of G[X] and M2 be a maximum 
matching of G[V(G) \ V(M1)]. Show that Mı U M2 is a maximal matching in G of 
size at most |M]. 

3. Obtain a 4*nO()-time algorithm for Minimum MAXIMAL MATCHING. 


3.15 (8). In the Max LEAF SPANNING TREE problem, we are given a connected graph 
G and an integer k, and the objective is to test whether there exists a spanning tree of 
G with at least k leaves. Obtain an algorithm with running time 4*n°) for Max LEAF 
SPANNING TREE. 


3.16 (%). An out-tree is an oriented tree with only one vertex of indegree zero called 
the root. In the DIRECTED Max LEAF problem, we are given an directed graph G and an 
integer k, and the objective is to test whether there exists an out-tree in G with at least k 
leaves (vertices of outdegree zero). Show that DiREcTED Max Lear admits an algorithm 
with running time 4*n©Q), 


3.17. Describe an algorithm running in time O(1.381”") which finds the number of inde- 
pendent sets (or, equivalently, vertex covers) in a given n-vertex graph. 


3.18. Show that if a graph on n vertices has minimum degree at least 3, then it contains a 
cycle of length at most 2[logn]. Use this to design a (logn)°(*)n©@)-time algorithm for 
FEEDBACK VERTEX SET on undirected graphs. Is this an FPT algorithm for FEEDBACK 
VERTEX SET? 


3.19. Prove that the statement of Lemma [3.6] is true for any optimum solution x, not 
necessarily a half-integral one. (For a not half-integral solution x, we define V* = {v € 
V(G) : ay > $} and VX = {v € V(G) : ay < $},) 
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3.20. A graph G is a split graph if V(G) can be partitioned into sets C and J, such that 
C is a clique and J is an independent set in G. In the SPLIT VERTEX DELETION problem, 
given a graph G and an integer k, the task is to check if one can delete at most k vertices 
from G to obtain a split graph. 

Give a polynomial-time algorithm that, given a SPLIT VERTEX DELETION instance 
(G,k), produces an equivalent VERTEX CoveR ABOvE MatTcuinc instance (G’, M,|M|+ 
k). Use this to design a 4knO)-time algorithm for SPLIT VERTEX DELETION. 


3.21. Show how to, given an ALMOST 2-SAT instance (y, k), compute in polynomial time 
an equivalent instance (y’,k) of VARIABLE DELETION ALMOST 2-SAT. 


3.22. Show the reverse of the previous exercise. That is, show how to, given an instance 
(p, k) of VARIABLE DELETION ALMOST 2-SAT, compute in polynomial time an equivalent 
instance (y’,k) of ALMost 2-SAT. 


3.23 (2). For an independent set J in a graph G, the surplus of I is defined as |N(I)|—|J|. 


1. Show that a graph reduced with respect to Reduction[VC.5](i-e., this reduction cannot 
be further applied) does not admit independent sets with nonpositive surplus. 

2. Show how to detect independent sets of surplus 1 in such a graph using the LP 
relaxation of VERTEX COVER. 

3. Design a reduction rule for VERTEX Cover that handles a (given) independent set of 
surplus 1. How does the measure u(G, k) = k — vc*(G) behave in your reduction rule? 

4. Show that, if a graph does not admit an independent set of surplus at most 1, then 
the branching of Theorem has branching vector (4, 1), and the algorithm runs in 
time 2.6181*-vo" (@) nO), 


3.24 (@). Consider the CLOSEST STRING problem, where in the input there are two strings 
x; and xj with dy(2;,7;) = 2d. Show that in this case the CLOSEST STRING problem can 
be solved in time O(kL + kd4*). 


3.25 (4). Consider a generalization of CLosrst STRING where, apart from the strings 
%1,22,...,%,, each over alphabet X and of length L, we are given integers d1, d2,..., dp, 
and we ask for a center string y of length L such that dy (y, xi) < di for every 1 < i < k. 
Consider the following recursive algorithm for this generalization. 


1. First, try if zı is a center string. If this is the case, then return xı and finish. 

2. Otherwise, if dı = 0, then return that there is no solution. 

3. In the remaining case, take any j > 2 such that d(x1, xj) > dj. Let P be the set of 
positions on which x; and æ; differ. If |P| > dı + dj, return that there is no solution. 

4. Guess the values y[p] for every p € P, where y is the center string we are looking for, 
in such a way that y differs from x; on at most d; positions from P for every 1 < i < k. 

5. For every guess (y[p])pep, define d; = di —|{pe€ P : y[p] A x: [p]}| and let z; be the 
string x; with letters on positions of P deleted. 

6. Observe that x = x}. Discard the string x}, and set d) := min(d} , d4). 

7. Recurse on the strings x; and integers di, for every guess (y[p])pep- 


Denote d = max) <;<x dj. 


1. Show that in every recursive step it holds that di < d;/2. 
2. Use this fact to show that this algorithms solves the generalization of CLOSEST STRING 
in time 20(9| D/4(KL)OM, 
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Hints 


Consider the following branching algorithm: given an instance (G, k), pick an arbitrary 
edge uv € E(G) and branch on two instances (G — u, k — 1) and (G — v, k — 1). Stop when 
G becomes edgeless or k < 0. Show that in this manner you generate at most 2” leaves 
of the search tree, and every minimal vertex cover of G of size at most k appears in some 
leaf. 


Observe that a graph is a cluster graph if and only if it does not contain a path on 
three vertices (P3) as an induced subgraph. As long as there exists a P3 as an induced 
subgraph, branch by choosing one of its vertices to delete. 


[3-5] Proceed as in Exercise [3.4] and observe that you can break a P in three different 
ways. 


Observe that the class of perfect graphs is closed under taking induced subgraphs, 
and a perfect graph is bipartite if and only if it does not contain a triangle. 


Observe that, if F is finite and fixed, then 


1. we can in polynomial time verify if a graph G is F-free and, if not, find an induced 
subgraph of G that is isomorphic to a member of F; 

2. there are only O(1) ways to break such a forbidden induced subgraph in all four 
considered problem variants. 


For every v € Z, branch on whether to include v or N (v) into the desired vertex cover. 
Observe that the remaining graph is bipartite. 


For the first point, observe that a shortest cycle in a directed graph cannot have any 
chord. Moreover, if £ is the length of some chordless cycle in D, then there are at least 
£(£ — 3)/2 pairs of nonadjacent vertices in D. To solve the second point, show how to find 
such a chordless cycle, and branch choosing which edge to delete from it. 

For the third and fourth point, observe that a tournament is acyclic if and only if it 
does not contain a directed triangle. 


Use again the observation that in the DIRECTED FEEDBACK Arc SET problem you 
can alternatively reverse edges instead of deleting them. Thus, the graph remains a bipartite 
tournament in the course of the algorithm, and every directed cycle on four vertices can 
be broken in exactly four ways in both considered problems. 


First, design an algorithm that either verifies that G is a chordal graph, or finds an 
induced cycle on at least four vertices in G. Second, observe that such a cycle on more 
than k+3 vertices is a certificate that (G, k) is a no-instance. For the first point, it suffices 
to branch choosing one edge to add to the cycle found. For the last point, the branching 
should consider all minimal triangulations of the cycle. 


Start with an assignment that sets all variables to false. As long as there exists an 
unsatisfied clause C, branch on C, in each subcases choosing one positive literal of C to 
be satisfied. In this way, you obtain an r*nO()-time algorithm. 


First, observe that if some variable appears only positively (or negatively), then we 
can assign its value in such a way that it does not satisfy any clause. Second, note that 
if the aforementioned preprocessing rule is not applicable, then branching on any variable 
makes at least one clause satisfied, decreasing the budget k by at least 1. 


The first point is straightforward. The fact that Mı U Mə is a maximal matching 
follows directly from the fact that M2 is a maximum matching in G — V(M1). For the 
cardinality bound, let Mx C M be the set of edges of M that have both endpoints in 
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X. Observe that |M| = |Mx|+|M \ Mx| = |Mx|+ (|X| — 2|Mx |) = |X| — |Mx |, while 
|My| > |Mx| and |Mo| < |X|- |V (M1)| = |X|—2|M1|. For the last point, use Exercise[3.3] 
to enumerate all minimal vertex covers of G of size at most 2k. 


Start with the following observation: if G is connected and has a subtree T, rooted 
at some vertex r, with at least k leaves, then G has also a spanning tree with at least k 
leaves. In our algorithm, we start by guessing a non-leaf vertex r of the tree in question, 
and consider all further trees as rooted in r. 

At every node of the search tree we have a subtree T of G rooted at node r. Let L(T) 
denote the set of leaves of T. Our objective in this branch is to find a subtree with k leaves 
(if it exists) that is a supergraph of T. 

First, observe that if there exists a leaf v € L(T) with at least two neighbors in V(G) \ 
V(T), then we can branch on v. In one subcase, we consider v being an internal vertex of 
the tree in question, adding all edges vu for u € N(v) \ V(Z) to the tree T. In the second 
subcase, we consider v being a leaf vertex of the tree in question, deleting all edges vu for 
u € N(v) \ V(T) from the graph G. Show that this branching is correct. Observe that in 
every branching step we either fix one vertex to be a leaf, or increase the number of leaves 
of T. Hence, this branching leads to at most 2?" leaves of the search tree. 

We are left with the case when all vertices v € L(T) have at most one neighbor outside 
V(T). Prove that in this case a similar branching is still valid: for a vertex v € L(T), either 
proclaim v a leaf, or fix as internal vertices both v and all its descendants, up to the closest 
vertex of degree at least three, inclusive. 

In the leaves of the search tree we have trees T with |L(T)| > k (where we report that 
(G, k) is a yes-instance), or trees T where all the leaves have been fixed as leaves, but still 
|L(T)| < k (where we report no solution in this branch). 


Adapt the solution of Exercise to the directed case. 


Note that there is a bijection between vertex covers and maximal independent sets, 
since the complement of a vertex cover is an independent set. Show a counting counterpart 
of Proposition [3-1] and design a branching algorithm where the number of leaves in the 
search tree is bounded by the solution of the recurrence T(n) = T(n—1) +T(n—4). Solve 
the recurrence by applying the techniques described in Section [3.2] (use Table[3.1}. 


To prove the desired upper bound on the length of the shortest cycle, consider a 
breadth-first search tree of the input graph. 

To show that the obtained algorithm is an FPT one, consider two possibilities. If n < k*, 
then logn < klogk and (logn)* < 20ls*), Otherwise, k = O(logn/loglogn) and 
(logn)* = 200087) = n°), Alternatively, you may use the Cauchy-Schwarz inequality 
to observe that 


2 2 
(log n)* = gk loglogn < gh Higa). = gk? /2 glog log n)? /2 = gk? /2 x ne). 


Apply Exercise to x, obtaining a half-integral solution y. Observe that Re- 
duction behaves the same way when fed with the solution x and with the solution 


y. 
In short, modify the reduction of Lemma [3.10 


For a given a graph G, we define a new graph G as follows. Let V; = {u; : u € V(G)}, 
i € {1,2}. The vertex set V (G) consists of two copies of V (G), i.e., V(G) = Vi U V2 and 


E(G) = {u1u2 : u € V(G)}U{u1v1 : uv € E(G)}U {uz2v2 : w ¢ E(G)}. 
In other words, G is obtained by taking a disjoint copy of G and a complement of G, and 


adding a perfect matching such that the endpoints of every matching edge are copies of 
the same vertex. 
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The main idea is to make, for every variable x, a different copy for every appearance 
of x in the formula y. To create the formula y’, we first copy all the clauses of p, replacing 
every variable x with one of its copies so that no copy is used twice, and then we add 
clauses enforcing equality between every pair of copies of the same variable. Show that a 
deletion of a clause C in the original instance corresponds to the deletion of one of the 
copies of a variable appearing in C in the new instance. 


[3.22] The main idea is to replace every variable x with its two copies, x! and z+: a! has 
the meaning “Is x true?”, whereas z+ has the meaning “Is « false?”. We connect them by a 
clause sa! Vat to enforce that only one value of x is chosen. Moreover, we replace every 
literal x with «', and every literal =z with «+. Argue that in the new instance (y’, k) 
of ALMOST 2-SAT there exists a minimum solution that deletes only clauses of the form 
aa! V-a+. Observe that deleting such a clause corresponds to setting x to both true and 
false at once in the input formula y, satisfying all the clauses containing zx. 





[3.23] Let J be an independent set in G of surplus a. Define the following vector x = 
(tv)vev(G): £v = 0 for v € I, £y = 1 for v € N(I) and ay = 4 for v ¢ N[I]. Observe that 


x is a feasible solution to LPVC(G) of cost nac + 5. This proves the first point. 

Moreover, it also proves the following: if we pick any vertex v and solve LPVC(G) with 
an additional constraint x, = 0 (in other words, solve LPVC(G — N[v]) and extend it with 
Ly = 0 and £u = 1 for every u E€ N(v)), obtaining a half-integral solution x, then the set 
Vý is an independent set of minimum surplus among all independent sets containing v. In 
particular, this gives us a way to detect independent sets with surplus 1, as well as shows 
that, if the minimum surplus of an independent set containing v is at least 2, then in a 
branch where we exclude v from a vertex cover, the measure u(G, k) drops by at least 1. 

It remains to argue about the third point, tackling with a reduction rule. To this end, 
prove the following: if Reduction [VC.5]is not applicable, and J is of surplus 1, then there 
exists a minimum vertex cover of G that contains the whole J or the whole N(J). Recall 
|I| + 1 = |N(J)|. If GLN(J)] contains an edge, then we can greedily take N(JI) into the 
vertex cover. Otherwise, we can replace N{J] with a single vertex incident to N(N[I]) and 
decrease k by |I|. A direct check shows that the measure u(G, k) does not increase in these 
steps. 


Observe that, in this case, for every center string y, y[p] equals x;[p] whenever 
xilo] = x;[p], whereas for every position p with x;[p] 4 xj;[p] we have y[p] = 2;[p] or 
ylp] = x;[p]. Thus, there are two options for each of 2d positions where x; and zj differ. 


For the first point, consider two cases depending on for how many positions p € P 
the character y[p] has been guessed so that y[p] = x;[p]. If there are more than d;/2 such 
positions, then xı differs on all of them from y, and we have dj, < d;/2. Show that in the 
second case, when there are at most dı /2 such positions, it holds that di < d,/2. Use here 
the fact that |P| > dj. 

For the second point, think of choosing (y[p])pep as a three stage process. First, we 
choose an integer 0 < £ < dj: y will differ from xı on exactly £ positions of P. Second, 
we choose the set Q of these £ positions; there are (17) < (%7) possibilities. Third, we 
choose y[p] for every p € Q; there are (|X| — 1)£ possibilities. 

Using dı as a measure for the complexity of an instance, and using the first point, we 
obtain the following recurrence for time complexity. 


dı 
Ta) < XS (OF) U21- D- Tmin(a: — 6, a1/2) 


£=0 
Ldi /2| didi dı PEA 
= A y Aera S A y a: 


e=0 é=|d,/2|+1 
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It remains to use the aforementioned recurrence to prove by induction that 


Tia) < (77 “284 15) — 1). 


d+ dı 

dı 
(Constant 6 in the exponent is not optimal; it has been chosen so that the proof goes 
smoothly.) 
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Chapter 4 
Iterative compression 


In this chapter we introduce iterative compression, a 
simple yet very useful technique for designing fixed- 
parameter tractable algorithms. Using this technique 
we obtain FPT algorithms for FEEDBACK VERTEX 
SET IN TOURNAMENTS, FEEDBACK VERTEX SET and 
ODD CYCLE TRANSVERSAL. 





In 2004, Reed, Smith and Vetta presented the first fixed-parameter 
tractable algorithm for ODD CYCLE TRANSVERSAL, running in time 3*nO). 
This result is important not only because of the significance of the prob- 
lem, but also because the proposed approach turned out to be a novel and 
generic technique, applicable in many other situations. Based on this new 
technique, called nowadays iterative compression, a number of FPT algo- 
rithms for several important problems have been obtained. Besides ODD Cy- 
CLE TRANSVERSAL, examples include DIRECTED FEEDBACK VERTEX SET 
and ALMOST 2-SAT. 

Typically, iterative compression algorithms are designed for parameterized 
minimization problems, where the goal is to find a small set of vertices or 
edges of the graph, whose removal makes the graph admit some global prop- 
erty. The upper bound on the size of this set is the parameter k. The main 
idea is to employ a so-called compression routine. A compression routine is 
an algorithm that, given a problem instance and a corresponding solution, 
either calculates a smaller solution or proves that the given solution is of the 
minimum size. Using a compression routine, one finds an optimal solution 
to the problem by iteratively building up the structure of the instance and 
compressing intermediate solutions. 

The main point of the iterative compression technique is that if the com- 
pression routine runs in FPT time, then so does the whole algorithm. The 
strength of iterative compression is that it allows us to see the problem from 
a different viewpoint: The compression routine has not only the problem 
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instance as input, but also a solution, which carries valuable structural in- 
formation about the instance. Therefore, constructing a compression routine 
may be simpler than designing a direct FPT algorithm for the original prob- 
lem. 

While embedding the compression routine into the iteration framework is 
usually straightforward, finding the compression routine itself is not. There- 
fore, the art of iterative compression typically lies in the design of the com- 
pression routine. In this chapter we design algorithms for FEEDBACK VER- 
TEX SET IN TOURNAMENTS, FEEDBACK VERTEX SET and ODD CYCLE 
TRANSVERSAL using the method of iterative compression. An important case 
of ALMOST 2-SAT is covered in the exercises. This technique will be also ap- 
plied in Chapter [7]to solve PLANAR VERTEX DELETION, and in Chapter [8] 
for DIRECTED FEEDBACK VERTEX SET. 


4.1 Illustration of the basic technique 


The technique described in this section is based on the following strategy. 


Solution compression: First, apply some simple trick so that you can 
assume that a slightly too large solution is available. Then exploit the 
structure it imposes on the input graph to construct an optimal solution. 


As a simple example of this approach, let us try to apply it to our favourite 
example problem VERTEX Cover. The algorithm we are going to obtain now 
is much worse than the one obtained in the previous chapter, but it serves well 
for the illustration. Assume we are given a VERTEX COVER instance (G, k). 
We use the well-known 2-approximation algorithm to obtain an approximate 
vertex cover Z. If |Z| > 2k, then we can clearly conclude that (G,k) is a 
no-instance, so assume otherwise. We are now going to exploit the structure 
that Z imposes on the graph G: Z is small, so we can afford some branching 
on Z, while at the same time G — Z is edgeless. 

The branching step is as follows: we branch in all possible ways an optimal 
solution X can intersect Z. Let Xz C Z be one such guess. We are searching 
now for a vertex cover X of size at most k, such that X N Z = Xz. Let 
W = Z\ Xz. If there is an edge in G[W], then we can clearly conclude 
that our guess of Xz was wrong. Otherwise, note that any vertex cover X 
satisfying X N Z = Xz needs to include Xz U Ne(W). Furthermore, since 
G — Z is an independent set, Xz U NG(W) is actually a vertex cover of G. 
Consequently, if for some choice of Xz C Z we have |Xz U Ne(W)| < k, 
then we return a solution X := Xz U Ne(W), and otherwise we conclude 
that (G,k) is a no-instance. Thus, we obtain an algorithm solving VERTEX 
Cover in time 2!2/nOQW < 4knO, 
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Let us go back to the point when we have guessed the set Xz and defined 
W = Z\ Xz. In the previous paragraph we have in fact argued that the 
following DISJOINT VERTEX COVER problem is polynomial-time solvable: 
Does G — Xz contain a vertex cover of size at most k — |Xz| that is disjoint 
from W? Note here that W is a vertex cover of G — Xz, giving us a lot of 
insight into the structure of G — Xz. 

In our example, the final dependency of the running time on k is much 
worse than the one obtained by branching algorithms in the previous chapter. 
One of the reasons is that we started with a large set Z, of size at most 2k. 
Fortunately, there is an easy and very versatile way to obtain a set Z of size 
k +1. This is exactly the main trick of iterative compression. 

As the name suggests, in iterative compression we apply the compression 
step iteratively. To exemplify this idea on VERTEX COVER, let us take an 
arbitrary ordering (v1,v2,...,Un) of G. For i € {1,...,k}, we denote by G; 
the subgraph of G induced by the first i vertices. For i = k, we can take the 
vertex set of G; as a vertex cover X in G; of size k. We proceed iteratively. 
Suppose that for some i > k, we have constructed a vertex cover X; of Gi 
of size at most k. Then in graph Gj41, set Zj41 := X; U {uj41} is a vertex 
cover of size at most k + 1. If actually |Z;+ı| < k then we are done: we can 
simply put X;41 = Zi+ı and proceed to the next iteration. Otherwise we have 
|Z;41| = k + 1, and we need to compress the too large solution. By applying 
the branching algorithm described above, i.e., solving 2!7*+1! = 2*+! instances 
of DISJOINT VERTEX COVER, in time 2*+1n°™, we can either find a vertex 
cover Xi+ı of G; of size at most k, or conclude that no such cover exists. If 
G; does not admit a vertex of size at most k, then of course neither does G, 
and we may terminate the whole iteration and provide a negative answer to 
the problem. If Xi+ı has been found, however, then we may proceed further 
to the graph Gi+2 and so on. To conclude, observe that Gn = G, so at the 
last iteration we obtain a solution for the input VERTEX COVER instance in 
time nl, 

Combined with other ideas, this simple strategy becomes a powerful tech- 
nique which can be used to solve different parameterized problems. Let us 
sketch how this method can be applied to a graph problem. The central idea 
here is to design an FPT algorithm which for a given (k + 1)-sized solution 
for a problem either compresses it to a solution of size at most k or proves 
that there is no solution of size at most k. This is known as the compression 
step of the algorithm. The method adopted usually is to begin with a sub- 
graph that trivially admits a k-sized solution and then expand it iteratively. 
In any iteration, we try to find a compressed k-sized solution for the instance 
corresponding to the current subgraph. If we find such a solution, then by 
adding a vertex or an edge we obtain a solution to the next instance, but 
this solution can be too large by 1. To this solution we apply the compres- 
sion step. We stop when we either obtain a solution of size at most k for the 
entire graph, or if some intermediate instance turns out to be incompressible. 
In order to stop in the case when some intermediate instance turns out to 
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be incompressible, the problem must have the property that the optimum 
solution size in any intermediate instance is at most the optimum solution 
size in the whole graph. 


4.1.1 A few generic steps 


We start with explaining the generic steps of iterative compression that are 
common to most of the problems. We focus here on vertex subset problems; 
the steps for an edge subset problem are analogous. Suppose that we want 
to solve (*)-COMPRESSION, where the wildcard (*) can be replaced by the 
name of the problem we are trying to solve. In (*)-COMPRESSION, as input 
we are given an instance of the problem (*) with a solution of size k + 1 
and a positive integer k. The objective is to either find a solution of size at 
most k or conclude that no solution of size at most k exists. For example, 
for (x) =FEEDBACK VERTEX SET, we are given a graph G and a feedback 
vertex set X of size k + 1. The task is to decide whether G has a feedback 
vertex set of size at most k. 

The first observation that holds for all the problems in this section is the 
following: 


c 


If there exists an algorithm solving (*)- COMPRESSION in time f(k)-n°, 
then there exists an algorithm solving problem (x) in time O(f(k):n°*"). 


We already explained how to prove such an observation for VERTEX 
CovER. We will repeat it once again for FEEDBACK VERTEX SET IN TOUR- 
NAMENTS and skip the proofs of this observation for FEEDBACK VERTEX 
SET and ODD CYCLE TRANSVERSAL. 

Now we need to compress a solution Z of size k + 1. In all our examples 
we follow the same strategy as we did for VERTEX COVER. That is, for every 
i € {0,...,k} and every subset Xz of Z of size i, we solve the following 
DISsJOINT-(*) problem: Either find a solution X to (x) in G— Xz such that 
|X| < k— i, where X and W := Z \ Xz are disjoint, or conclude that this is 
impossible. Note that in the disjoint variant we have that |W| = k— i+ 1, so 
again the size of the solution W is one larger than the allowed budget; the 
difference now is that taking vertices from W is explicitly forbidden. We use 
the following observation, which follows from simple branching. 


If there exists an algorithm solving DISJOINT-(*) in time g(k) - nO, 


then there exists an algorithm solving (*)-COMPRESSION in time 
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k 


5 ( *) alk — n°), 


i=0 


In particular, if g(k) = a*, then (+*)-COMPRESSION can be solved in 
time (1 + a)*nO, 


Thus, the crucial part of the algorithms based on iterative compression lies 
in solving the disjoint version of the corresponding problem. We will provide 
the detailed proof of the above observation for FEEDBACK VERTEX SET IN 
TOURNAMENTS. We will not repeat these arguments for FEEDBACK VERTEX 
SET and ODD CYCLE TRANSVERSAL, and explain only algorithms solving 
DIsJOINT-(*) for these problems. 

Finally, let us point out that when designing algorithms for the compres- 
sion and disjoint versions of a problem, one cannot focus only on the decision 
version, where the task is just to determine whether a solution exists. This 
is because constructing an actual solution is needed to perform the next step 
of the iteration. This issue is almost never a real problem: either the algo- 
rithm actually finds the solution on the way, or one can use the standard 
method of self-reducibility to query a decision algorithm multiple times in 
order to reconstruct the solution. However, the reader should be aware of the 
caveat, especially when trying to estimate the precise polynomial factor of 
the running time of the algorithm. 


4.2 FEEDBACK VERTEX SET IN TOURNAMENTS 


In this section we design an FPT algorithm for FEEDBACK VERTEX SET IN 
TOURNAMENTS (F VST) using the methodology of iterative compression. In 
this problem, the input consists of a tournament T and a positive integer k, 
and the objective is to decide whether there exists a vertex set X C V(T) 
of size at most k such that T — X is a directed acyclic graph (equivalently, 
a transitive tournament). We call the solution X a directed feedback vertex 
set. Let us note that FEEDBACK VERTEX SET IN TOURNAMENTS is a special 
case of DIRECTED FEEDBACK VERTEX SET, where the input directed graph 
is restricted to being a tournament. 

In what follows, using the example of FEEDBACK VERTEX SET IN TOUR- 
NAMENTS we describe the steps that are common to most of the applications 
of iterative compression. 

We first define the compression version of the problem, called FEEDBACK 
VERTEX SET IN TOURNAMENTS COMPRESSION. In this problem, the input 
consists of a tournament T, a directed feedback vertex set Z of T of size 
k +1, and a positive integer k, and the objective is either to find a directed 
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feedback vertex set of T of size at most k, or to conclude that no such set 
exists. 

Suppose we can solve FEEDBACK VERTEX SET IN TOURNAMENTS COM- 
PRESSION in time f(k)-n°. Given this, we show how to solve the original 
problem in f(k)-n°™ time. We take an arbitrary ordering (v1, v2,...,Un) of 
V(T) and for every i € {1,...,n} we define V; = {v1,...,u;} and T; = T[Vj]. 
Notice that 


e V} is a directed feedback vertex set of size k of T),. 

e If X is a directed feedback vertex set of T;, then X U {vi+1} is a directed 
feedback vertex set of Tj+1. 

e If 7; does not admit a directed feedback vertex set of size at most k, then 
neither does T. 


These three facts together with an f(k) - n°-time algorithm for FEEDBACK 
VERTEX SET IN TOURNAMENTS COMPRESSION imply an f(k)-n°+!-time al- 
gorithm for FEEDBACK VERTEX SET IN TOURNAMENTS as follows. In tour- 
nament Tk set Vk is a directed feedback vertex set of size k. Suppose that for 
i > k we have constructed a directed feedback vertex set X; of T; of size at 
most k. Then in Tj41, set Zj41 := X; U {v;41} is a directed feedback vertex 
set of size at most k + 1. If actually |Z;41| < k, then we may proceed to the 
next iteration with X;,1 = Z;41. Otherwise we have |Z;41| = k +1. Then in 
time f(k) - n° we can either construct a directed feedback vertex set Xj+1 in 
Ti41 of size k, or deduce that (T, k) is a no-instance of FEEDBACK VERTEX 
SET IN TOURNAMENTS. By iterating at most n times, we obtain the following 
lemma. 


Lemma 4.1. The existence of an algorithm solving FEEDBACK VERTEX SET 
IN TOURNAMENTS COMPRESSION in time f(k)-n° implies that FEEDBACK 
VERTEX SET IN TOURNAMENTS can be solved in time O(f(k)- n°"). 


In all applications of iterative compression one proves a lemma similar to 
Lemma [4.1] Hence, the bulk of the work goes into solving the compression 
version of the problem. We now discuss how to solve the compression problem 
by reducing it to a bounded number of instances of the following disjoint ver- 
sion of the problem: DISJOINT FEEDBACK VERTEX SET IN TOURNAMENTS. 
In this problem, the input consists of a tournament T together with a di- 
rected feedback vertex set W and the objective is either to find a directed 
feedback vertex set X C V(T) \ W of size at most k, or to conclude that no 
such set exists. 

Let Z be a directed feedback vertex set of size k+1 in a tournament T. To 
decide whether T contains a directed feedback vertex set X of size k (i.e., to 
solve a FEEDBACK VERTEX SET IN TOURNAMENTS COMPRESSION instance 
(G, Z,k)), we do the following. We guess the intersection of X with Z, that 
is, we guess the set Xz := X N Z, delete Xz from T and reduce parameter 
k by |Xz|. For each guess of Xz, we set W := Z \ Xz and solve DISJOINT 
FEEDBACK VERTEX SET IN TOURNAMENTS on the instance (T — Xz,W,k— 
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|X z|). If for some guess Xz we find a directed feedback vertex set X’ of T—Xz 
of size at most k—|X z| that is disjoint from W, then we output X := X’UXz. 
Otherwise, we conclude that the given instance of the compression problem 
is a no-instance. The number of all guesses is bounded by ur er), So to 
obtain an FPT algorithm for FEEDBACK VERTEX SET IN TOURNAMENTS 
COMPRESSION, it is sufficient to solve DISJOINT FEEDBACK VERTEX SET IN 
TOURNAMENTS in FPT time. This leads to the following lemma. 


Lemma 4.2. If there exists an algorithm solving DISJOINT FEEDBACK VER- 
TEX SET IN TOURNAMENTS in time g(k)-nO”), then there exists an algorithm 
solving FEEDBACK VERTEX SET IN TOURNAMENTS COMPRESSION in time 
ee (**) g(k-1) -nO). In particular, if g(k) = a* for some fixed constant 
a, then the algorithm runs in time (a +1)* nO, 


By Lemmas [4.1]and we have that 


Solving FEEDBACK VERTEX SET IN TOURNAMENTS boils down to solv- 
ing the disjoint variant of the problem. 


There is nothing very particular about FEEDBACK VERTEX SET IN TOUR- 
NAMENTS in Lemma[4.2] In many graph modification problems, for which the 
corresponding disjoint version can be solved in time g(k)-n°“), one can ob- 
tain an algorithm for the original problem by proving a lemma analogous to 
Lemma [4.2] We need only the following two properties: (i) a way to deduce 
that if an intermediate instance is a no-instance, then the input instance is 
a no-instance as well; and (ii) a way to enhance a computed solution X; of 
an intermediate instance G; with a new vertex or edge to obtain a slightly 
larger solution Z;, of the next intermediate instance G41. 


4.2.1 Solving DISJOINT FEEDBACK VERTEX SET IN 
TOURNAMENTS in polynomial time 


Our next step is to show that DISJOINT FEEDBACK VERTEX SET IN TOUR- 
NAMENTS can be solved in polynomial time. As we already discussed, to- 
gether with Lemmas [4.1] and this shows that FEEDBACK VERTEX SET 
IN TOURNAMENTS can be solved in time 25n ®., 

For our proof we need the following simple facts about tournaments, which 
are left as an exercise. 


Lemma 4.3. Let T be a tournament. Then 


1. T has a directed cycle if and only if T has a directed triangle; 
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2. If T is acyclic then it has unique topological ordering. That is, there exists 
a unique ordering < of the vertices of T such that for every directed edge 
(u,v), we have u < v (that is, u appears before v in the ordering <). 


We now describe the algorithm for the disjoint version of the problem. 

Let (T,W,k) be an instance of DISJOINT FEEDBACK VERTEX SET IN 
TOURNAMENTS and let A = V(T) \ W. Recall that we have that |W| = 
k + 1. Clearly, we can assume that both T[W] and TA] induce transitive 
tournaments, since otherwise (T, W, k) is a no-instance. By Lemma [4.3] in 
order to solve DISJOINT FEEDBACK VERTEX SET IN TOURNAMENTS it is 
sufficient to find a set of at most k vertices from A intersecting all the directed 
triangles in T. This observation gives us the following simple reduction. 


Reduction FVST.1. If T contains a directed triangle x,y,z with exactly 
one vertex from A, say z, then delete z and reduce the parameter by 1. The 
new instance is (T — {z}, W, k — 1). 


Reduction[FVST.1|simply says that all directed triangles in T with exactly 
one vertex in A can be eliminated by picking the corresponding vertex in A. 
Safeness of Reduction [FVST.1] follows from the fact that we are not allowed 
to select any vertex from W. 

Given an instance (T, W, k), we first apply Reduction|FVST. lJexhaustively. 
So from now onwards assume that Reduction [FVST.1]is no longer applicable. 
Since tournaments T[W] and T|[A] are acyclic, by Lemma we know that 
they both have unique topological orderings. Let the topological orderings of 
T[W] and T[A] be denoted by o = (wi,..., wq) and p, respectively. Suppose 
X is a desired solution; then T[W U (A \ X)] is a transitive tournament with 
the unique ordering such that when we restrict this ordering to W, we obtain 
g, and when we restrict it to A \ X, we get restriction of p to A \ X. Since 
the ordering of ø is preserved in the ordering of T[W U(A\ X)], our modified 
goal now is as follows. 


Insert a maximum-sized subset of A into ordering o = (wj,..., Wg). 


Every vertex v € A has a “natural position” in ø, say p[v|, defined as follows. 
Since Reduction [FVST.1]is no longer applicable, for all v € A, we have that 
T[W U {v}] is acyclic and the position of v in ø is its position in the unique 
topological ordering of T[W U {v}]. Thus, there exists an integer p[v] such 
that for i < piv], there is an arc from w; to v and for all i > p[v] there is an 
arc from v to w;. Thus we get that 


(v, wi) € E(T) 4> i> pfo]. (4.1) 


Observe that p[v] is defined for all v € A, it is unique, and p[v] € {1,...,q+1}- 
We now construct an ordering 7 of A as follows: u is before v in 7 if and 
only if plu] < plv] or plu] = piv] and u is before v in the ordering p. That 
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is, in the ordering m we take iteratively the sets {v € A : plv] = i} for 
i = 1,2,...,q + 1 and, within each set, we order the vertices according to p, 
the topological order of T[A]. 


The main observation now is as follows: 


1. In the transitive tournament T — X, the topological ordering of 
T[A \ X] needs to be a restriction of 7, because T[W] remains in 
T — X and ø is the topological ordering of T[W]. 

2. On the other hand, the topological ordering of T[A \ X] needs to 
be a restriction of p, the topological ordering of T[A]. 


Consequently, it suffices to search for the longest common subsequence 
of 7 and p. 


We next prove a lemma which formalizes the intuition suggested above. 


Lemma 4.4. Let B C A. Then T|W U B] is acyclic if and only if the vertices 
of B form a common subsequence of p and T. 


Proof. By p|g and z|g we denote restrictions of p and x to B, respectively. 

For the forward direction, suppose that T|W U B] is acyclic. We show that 
ple = T|g and hence vertices of B form a common subsequence of p and 
ma. Targeting a contradiction, assume that there exist x,y € B such that x 
appears before y in p|g and y appears before x in z|g. Then (x,y) € E(T), 
and ply] < pla]. Moreover, if it was that plz] = ply], then the order of x 
and y in m would be determined by p. Thus we conclude that ply] < p[z]. 
By (4-1), we have that (y, wpiyj) € E(T) and (wp) £) € E(T). Because of 
the directed edges (x,y), (Y, Wpjy]), and (Wpjyj, £), we have that {x, y, wpiy} 
induces a directed triangle in T[W U B], a contradiction. 

Now we show the reverse direction of the proof. Assume that the vertices 
of B form a common subsequence of p and 7. In particular, this means that 
ple = |g. To show that T[W U B] is acyclic, by Lemma [4.3] it is sufficient 
to show that T[W U B] contains no directed triangles. Since T[W] and T[A] 
are acyclic and there are no directed triangles with exactly two vertices in 
W (Reduction [FVST.1}, it follows that there can only be directed triangles 
with exactly two vertices in B. Since p|z = |g, for all x,y € B with (x,y) € 
E(T), we have that pia] < ply]. Then by (4-1), there is no w; € W with 
(y,wi) E€ E(T) and (w;,x) € E(T). Hence there is no directed triangle in 
T|W U B], and thus it is acyclic. This completes the proof of the lemma. 














We need the following known fact about the longest common subsequence 
problem, whose proof we leave as Exercise [4.2] 


Lemma 4.5. A longest common subsequence of two sequences with p and q 
elements can be found in time O(pq). 
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We are ready to describe the algorithm now. 


Lemma 4.6. DISJOINT FEEDBACK VERTEX SET IN TOURNAMENTS is solv- 
able in polynomial time. 


Proof. Let (T, W, k) be an instance of the problem. We use a ee 
exhaustively. Let R be the set of vertices deleted by Reduction and 
let (I’, W, k’) be the reduced instance. 

By Lemma [4.4] the optimal way to make T” acyclic by vertex deletions is 
exactly the same as that to make sequences p and 7 equal by vertex deletions. 
Thus, to find an optimal vertex deletion set, we can find a longest common 
subsequence of p and m, which can be done in polynomial time by Lemma|4.5] 
Let B be the vertices of a longest common subsequence of p and m and let 
X := RU(V(T")\ B). If |X| > k; then (T, W, k) is a no-instance. Otherwise, 
X is the desired directed feedback vertex set of size at most k. 














Lemmas [4.1]and [4.2] combined with Lemma |4.6] give the following result. 


Theorem 4.7. FEEDBACK VERTEX SET IN TOURNAMENTS can be solved in 
time Enl, 


4.3 FEEDBACK VERTEX SET 


In this section we give an algorithm for the FEEDBACK VERTEX SET problem 
on undirected graphs using the method of iterative compression. Recall that 
X is a feedback vertex set of an undirected graph G if G — X is a forest. 
We give only the algorithms for the disjoint version of the problem. As was 
discussed in Section the existence of an algorithm with running time 
a*n°™ for the disjoint variant problem would yield that FEEDBACK VERTEX 
SET is solvable in time (1 + a)*'nOM, 

We start by defining DISJOINT FEEDBACK VERTEX SET. In this problem, 
as input we are given an undirected graph G, integer k and a feedback vertex 
set W in G of size k + 1. The objective is to find a feedback vertex set 
X CV(G)\W of size at most k, or correctly conclude that no such feedback 
vertex set exists. We first give an algorithm for DISJOINT FEEDBACK VERTEX 
SET running in time 44n, and then we provide a faster algorithm with 
running time y?n), where y = 1v5 < 1.6181 denotes the golden ratio. 
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4.8.1 First algorithm for DISJOINT FEEDBACK VERTEX 
SET 


Let (G,W,k) be an instance of DISJOINT FEEDBACK VERTEX SET and let 
H = G — W. We first give a few reduction rules that simplify the input 
instance. 


Reduction FVS*.1. Delete all the vertices of degree at most 1 in G. 


Reduction FVS*.2. If there exists a vertex v in H such that G[W U {v} 
contains a cycle, then include v in the solution, delete v and decrease the 
parameter by 1. That is, the new instance is (G — {v}, W, k — 1). 


Reduction FVS*.3. If there is a vertex v € V(H) of degree 2 in G such 
that at least one neighbor of v in G is from V(#), then delete this vertex 
and make its neighbors adjacent (even if they were adjacent before; the graph 
could become a multigraph now). 


It is easy to see that Reductions |FVS*.1| |FVS*.2| and |FVS*.3] are safe 


and that they produce an equivalent instance. Furthermore, all of them can 
be applied in polynomial time. Now we are ready to state the main lemma 
of this section. 


Lemma 4.8. DISJOINT FEEDBACK VERTEX SET is solvable in time 45n ® 


Proof. We give only an algorithm for the decision variant of the problem, i.e., 
we only verify whether a solution exists or not. It is straightforward to modify 
the algorithm so that it actually finds a solution, provided there exists one. 

We will follow a branching strategy with a nontrivial measure function. 
Let (G, W, k) be the input instance. If G[W] is not a forest then return that 
(G,W,k) is a no-instance. So from now onwards we assume that G[W] is 


indeed a forest. The algorithm first applies Reductions |FVS*.1| |FVS*.2|and 
FVS*.3] exhaustively. For clarity we denote the reduced instance (the one on 


which Reductions [FVS*.2]and [FVS*.3] do not apply) by (G, W, k). 
If k < 0, then return that (G, W, k) is a no-instance. 

From now onwards we assume that k > 0. Recall that H is a forest as W is 
a feedback vertex set. Thus H has a vertex x of degree at most 1. Furthermore, 
x has at least two neighbors in W, otherwise Reduction [FVS*.1] or 
would have been applied. Since Reduction [FVS*.2]cannot be applied, we have 
that no two neighbors of x belong to the same connected component of G[W]. 
Now branch by including x in the solution in one branch and excluding it in 
the other branch. That is, we call the algorithm on instances (G—{x}, W, k—1) 
and (G,W U {z}, k). If one of these branches returns a solution, then we 
conclude that (G, W, k) is a yes-instance, otherwise (G, W, k) is a no-instance. 
The correctness of this algorithm follows from the safeness of our reductions 
and the fact that the branching is exhaustive. 
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To estimate the running time of the algorithm, for instance I = (G, W, k), 

we define its measure 
wd) =k +7), 

where y(J) is the number of connected components of G[W]. Observe that 
Reductions and [FVS*.3] do not increase the measure. How 
does y(I) change when we branch? When we include x in the solution, k 
decreases by 1 and (J) remains the same, and thus (T) decreases by 1. In 
the other branch, k remains the same, while x has neighbors in at least two 
connected components of W. Hence, when we include x in W, (IT) drops 
by at least 1. Thus, we have a branching vector (1,1) and the running time 
of our branching algorithm is 2““)n°. Since at the beginning we have 
(lI) <k+|W| < 2k +1, we obtain the desired running time. 


Similarly to Lemmas [4.1]and [4.2] (see also the discussion in Section |4.1.1), 


one can use iterative compression to show that the a*n°(-time algorithm for 
DISJOINT FEEDBACK VERTEX SET can be used to solve FEEDBACK VERTEX 
SET in time (1+ a)*nO™. Hence, Lemma|4.8]implies the following theorem. 














Theorem 4.9. FEEDBACK VERTEX SET can be solved in time 5*'n@). 


*4.3.2 Faster algorithm for DISJOINT FEEDBACK VERTEX 
SET 


Now we show how to modify the algorithm of Theorem [4.9]to obtain “almost” 
the fastest known deterministic algorithm for FEEDBACK VERTEX SET. 

We start with a few definitions. Let (G,W,k) be an instance of DISJOINT 
FEEDBACK VERTEX SET and H = G—W. A vertex v € V (H) is called a tent 
if its degree in G is 3 and all its neighbors are in W. In other words, v has 
degree 3 in G and it is an isolated vertex in H. We call a vertex v € V(H) 
nice if its degree in G is 2 and both its neighbors are in W. The following 
lemma explains why we are interested in nice vertices and tents. 


Lemma 4.10. Let (G, W, k) be an instance of DISJOINT FEEDBACK VERTEX 
SET such that every vertex from V (H) is either nice or a tent. Then DISJOINT 
FEEDBACK VERTEX SET on (G,W,k) can be solved in polynomial time. 


The proof of Lemma [4.10] is based on a polynomial-time algorithm for the 
“matroid parity” problem for graphic matroids. We defer the proof of this 
Lemma to Section [12.2.2] and here we use it as a black box to obtain the 
desired algorithm for FEEDBACK VERTEX SET. 

For the new algorithm we also need one more reduction rule apart from 


Reductions and |FVS*.3 


Reduction FVS*.4. Let v € V(H) be a vertex of degree 3 in G, with one 
neighbor u in V(#) and two neighbors in W. In particular, v is a leaf in H. 
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Fig. 4.1: Reduction |FVS*.4 


Subdivide the edge vu and move the newly introduced vertex, say w, to W. 
Let the new graph be named G’ and W” = W U {w}. The new instance is 


(G', W', k). See Fig. [4.1] 

The safeness of Reduction [FVS*.4] follows from the fact that introducing 
an undeletable degree-2 vertex in the middle of an edge does not change the 
set of feasible solutions to DISJOINT FEEDBACK VERTEX SET. Let us also 
observe that even though Reduction[FVS*.4]increases the number of vertices, 
it also increases the number of tents. Furthermore, as we never add vertices 
to H, this rule can be applied at most |V(H)| times. 

Next we define the new measure which is used to estimate the running 
time of the new algorithm for DISJOINT FEEDBACK VERTEX SET. With an 
instance I = (G,W,k), we associate the measure 


wd) =k +77) — 72). 


Here, y(Z) denotes the number of connected components of G[W] and r(J) 
denotes the number of tents in G. 
Next we show that our reduction rules do not increase the measure. 


Lemma 4.11. An application of any of the Reductions |F'VS*.1\ |F'VS*.2 
FVS*.3| and|F'VS*.4| does not increase u(T). 


Proof. The proofs for Reductions |FVS*.1| |FVS*.2] and |FVS*.3] are straight- 
forward. As for Reduction |FVS*.4| let I = (G,W,k) be the considered in- 


stance. We add vertex w to W thus creating a new connected component and 
increasing y by 1. We also create at least one tent at the vertex v (the vertex 
u may also become a tent), see Fig. Thus we increase 7 by at least 1. 
Thus, for the new instance J’ we have that u(I’) < p(T). 














The important thing we have to keep in mind is that: 


For a branching algorithm it is necessary that in all instances whose 
measure is nonpositive, it should be possible to solve the problem in 
polynomial time. 
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So let us now verify that this is the case for DISJOINT FEEDBACK VERTEX 
SET. The next lemma implies that if for an instance I we have r(I) > k+74(J), 
that is, if the measure of I is nonpositive, then J is a no-instance. In fact, we 
prove a slightly stronger statement. 


Lemma 4.12. Let I = (G,W,k) be an instance of DISJOINT FEEDBACK 
VERTEX SET. If TI) > k+ a then I is a no-instance. 


Proof. Suppose, for contradiction, that I is a yes-instance of the problem 
and let X be a feedback vertex set of size at most k disjoint from W. Thus 
G’ = G — X is a forest. Let T C V(G) \ W be the set of tents in G. Now 
consider G’'[W U (T \ X)]. Clearly, G’[W U (T \ X)] is also a forest. Now in 
G' |W U (T \ X)] we contract each connected component of G[W] into a single 
vertex and obtain a forest, say F. Then F has at most (I) +|T \ X| vertices, 
and hence has at most 7(I)+|T'\ X|—1 edges. However, the vertices of T\ X 
form an independent set in F (as they are tents), and each of them is of 
degree exactly three in F: the degree could not drop during the contraction 
step, since we just contracted some subtrees of the forest G’[W U (T \ X)]. 
Thus 3|T \ X| < y(1) +|T \ X|, and hence 2|T \ X| < (I). As |X| < k, we 


have that T(I) = |T| < k + 1D, This concludes the proof. 














Now we are ready to state the main technical lemma of this section. 


Lemma 4.13. DISJOINT FEEDBACK VERTEX SET can be solved in time 
yn, where p < 1.6181 is the golden ratio. 


Proof. Again, we give an algorithm only for the decision version of the prob- 
lem, and turning it into a constructive algorithm is straightforward. 

We follow a similar branching strategy, but this time we shall use our 
adjusted measure p(I) that takes into account also tents. Let (G, W, k) be 
the input instance. If G[W] is not a forest, then return that (G, W, k) is a 
no-instance. So from now onwards we assume that G[W] is indeed a forest. 


The algorithm first applies Reductions |FVS*.1||FVS*.2] |F VS*.3|and [F V S*.4 


exhaustively. For clarity we denote the reduced instance (the one on which 
Reductions [FVS* 1] [FVS*.3]and[FVS*.4]do not apply) by (G, W, k). 
If w(Z) < 0 then return that (G,W,k) is a no-instance. The correctness of 
this step follows from the fact that if (I) < 0 then 7(I) > k+ (J), and thus 
by Lemma [4.12] we have that the given instance is a no-instance. 

From now onwards we assume that (I) > 0. We now check whether every 
vertex in V(G) \ W is either nice or a tent. If this is the case, we apply 
Lemma|4.10]and solve the problem in polynomial time. Otherwise, we move 
to the branching step of the algorithm. The algorithm has only one branching 
rule which is described below. 


Pick a vertex x that is not a tent and has the maximum possible number 
of neighbors in W. 
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Now we branch by including x in the solution in one branch, and excluding 
it in the other branch. That is we call the algorithm on instances (G — 
{xz}, W, k — 1) and (G, W U {a}, k). If one of these branches is a yes-instance, 
then (G,W,k) is a yes-instance. Otherwise (G,W,k) is a no-instance. The 
correctness of this algorithm follows from the safeness of reductions and the 
fact that the branching is exhaustive. 

To compute the running time of the algorithm, let us see how measure 
u(I) = k+7(I)—7(L) changes. By Lemma we know that applications of 
Reductions[FVS*.1][FVS*.2|[FVS*.3]and do not increase the measure. 
Now we will see how p(I) changes when we branch. In the branch where we 
include z in the solution, we have that k decreases by 1, the number of tents 
does not decrease, and the number of connected components of G[W] remains 
the same. Thus, (I) decreases by 1. 

Now let us consider the other branch, where z is included into W. In this 
branch, k remains the same and 7(I) does not decrease. Since Lemma [4.10] 
is not applicable, we know that there exists a connected component of H 
that is not just a single tent, and pa exists a vertex u in this component 
whose degree in H is at most one (since H is a forest). Furthermore, since 
Reductions [FVS*.1] [FVS*.2| [FVS*.3] FV S*.3] and [FVS*.4) [FVS* 4] are not applicable, vertex 
u has at least three neighbors in W. Since in the branching algorithm we 
chose a vertex x that is not a tent and has the maximum possible number 
of neighbors in W, we have that x also has at least three neighbors in W. 
Since Reduction [FVS*.2] is not applicable, when we include z to W, graph 
G[W U {«}] remains a forest and its number of connected components y(J) 
drops by at least 2. Thus, (T) drops by at least 2. 

The last two paragraphs show that the algorithm has a branching vector 
of (1,2). Hence, the algorithm solves DISJOINT FEEDBACK VERTEX SET in 
time p(n. Since initially we have that (I) < k+|W| < 2k +1, we 
obtain the claimed running time. 














Pipelined with the iterative compression framework (see the discussion in 
Section |4.1.1), Lemma implies the following theorem. 


Theorem 4.14. FEEDBACK VERTEX SET can be solved in time 


(1+ y7)FnO = 3.6181'§nO™). 


4.4 ODD CYCLE TRANSVERSAL 


In this section we give an algorithm for ODD CYCLE TRANSVERSAL using the 
method of iterative compression. Recall that X is an odd cycle transversal 
of G if graph G — X is bipartite. Again, we follow the same generic steps as 
described in Section [4.1.1 


92 4 Iterative compression 


We start by defining DISJOINT ODD CYCLE TRANSVERSAL. In this prob- 
lem, on the input we are given an undirected graph G, an odd cycle transversal 
W of size k+1 and a positive integer k. The objective is to find an odd cycle 
transversal X C V(G) \ W of size at most k, or to conclude that no such 
odd cycle transversal exists. We give an algorithm for DISJOINT ODD CYCLE 
TRANSVERSAL running in time 2'n°@), 

Our algorithm for DISJOINT ODD CYCLE TRANSVERSAL will use the fol- 
lowing annotated problem as a subroutine. In the ANNOTATED BIPARTITE 
COLORING problem, we are given a bipartite graph G, two sets B1, B2 C 
V(G), and an integer k, and the goal is to find a set X consisting of at most 
k vertices, such that G — X has a proper 2-coloring f : V(G) \ X > {1,2} 
(i.e., f(u) A f(v) for every edge uv) that agrees with the sets Bı and Ba, 
that is, f(v) = i whenever v € B; \ X and i = 1,2. 


An algorithm for ANNOTATED BIPARTITE COLORING. Let (G, Bi, B2, k) 
be an instance of ANNOTATED BIPARTITE COLORING. We can view vertices 
of Bı and Bə as precolored vertices. We do not assume that this precoloring 
is proper, that is, a pair of adjacent vertices can be colored with the same 
color. Moreover, we do not assume that Bı and Bə are disjoint, thus some 
vertices can have both colors. We want to find a set X of size at most k 
such that in graph G — X there is a proper 2-coloring extending precolored 
vertices. To find such a coloring we proceed as follows. We fix an arbitrary 
proper 2-coloring f* of G, f* : V(G) — {1,2}. Clearly, such a coloring exists 
as G is a bipartite graph. Let B* = (f*)~1(i) for i = 1,2. The objective is 
to find a set S of at most k vertices such that G— S has another 2-coloring 
f such that B; \ X is colored i for i = 1,2. Observe that each vertex of 
C := (Bı BS) U (B2 N BY) should be either included in X, or have different, 
colors with respect to f* and f. That is, for every v € C \ X, it holds that 
f*(v) Æ f(v), i.e., the vertices of C \ X must change their colors. Similarly, 
each vertex of R := (Bı N BY) U (B2M Bž) that is not included in X should 
keep its color. Thus, for every v € R it holds that f*(v) = f(v), unless 
v E€ X. See the diagram on Fig. The following lemma helps us in solving 
the annotated problem. 


Lemma 4.15. Let G be a bipartite graph and f* be an arbitrary proper 2- 
coloring of G. Then set X is a solution for ANNOTATED BIPARTITE COL- 
ORING if and only if X separates C and R, i.e., no component of G — X 
contains vertices from both C \ X and R\ X. Furthermore, such a set X of 
size k (provided it exists) can be found in time O(k(n +m)). 


Proof. We first prove the forward direction of the proof. In a proper 2-coloring 
of G — X, say f, each vertex of V (G) \ X either keeps the same color (f(v) = 
f*(v)) or changes its color (f(v) 4 f*(v)). Moreover, two adjacent vertices 
have to pick the same decision. Therefore, for every connected component 
of G \ X, either all vertices v satisfy f(v) = f*(v) or all vertices v satisfy 
f(v) 4 f*(v). Consequently, no connected component of G \ X may contain 
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Fig. 4.2: Sets C and R 


a vertex of both R (which needs to keep its color) and C (which needs to 
change its color). 

For the backward direction, let X be a set separating R and C. Define the 
coloring f : V(G)\ X > {1,2} as follows: first set f = f* and then flip the 
coloring of those components of G — X that contain at least one vertex of 
C \ X. No vertex of R is flipped and thus this is the required 2-coloring. 

To find a vertex set of size at most k separating C and R, one can use 
the classic max-flow min-cut techniques, e.g., by k iterations of the Ford- 
Fulkerson algorithm (see Theorem|8.2). Thus we obtain the promised running 
time bound. 














Back to ODD CYCLE TRANSVERSAL. Now we are going to use our algorithm 
for ANNOTATED BIPARTITE COLORING to solve ODD CYCLE TRANSVERSAL 
by applying the iterative compression framework. As usual, to this end we 
first give an algorithm for the disjoint variant of the problem, DISJOINT ODD 
CYCLE TRANSVERSAL. Recall that here we are given a graph G, an integer 
k, and a set W of k+1 vertices such that G—W is bipartite. The objective is 
to find a set X C V(G)\W of at most k vertices such that G—X is bipartite, 
that is, admits a proper 2-coloring f, or to conclude that no such set exists. 
Towards this we do as follows. Every vertex v € W remains in G — X and 
hence G[W] has to be bipartite, as otherwise we are clearly dealing with a 
no-instance. The idea is to guess the bipartition of W in G — X. 

That is, we iterate over all proper 2-colorings fw : W — {1,2} of G[W] 
and we look for a set X C V(G) \ W such that G — X admits a proper 
2-coloring that extends fy. Note that there are at most 2IW1 = 2*+1 choices 
for the coloring fy. Let BW = fiy (i) for i = 1,2. 

Observe that every vertex v in By := Ne(BW) A (V (G) \ W) needs to be 
either colored 2 by f, or deleted (included in X). Similarly, every vertex v in 
Bı := Ne(BS) A (V (G) \ W) needs to be either colored 1 by f, or deleted. 
Consequently, the task of finding the set X and the proper 2-coloring f 
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that extends fw reduces to solving an ANNOTATED BIPARTITE COLORING 
instance (G — W, B1, B2, k). 

By Lemma[4.15] this instance can be solved in O(k(n+m)) time and hence 
the total running time for solving DISJOINT ODD CYCLE TRANSVERSAL is 
O(2* . k(n +m)). Thus we obtain the following lemma. 


Lemma 4.16. DISJOINT ODD CYCLE TRANSVERSAL can be solved in time 
O(2* -k(n +m)). 





Now Lemma/4.16} together with the iterative compression approach, yields 
the following theorem. 


Theorem 4.17. ODD CYCLE TRANSVERSAL is solvable in time O(3*-kn(n+ 


m)). 


Exercises 


4.1 (Z). Prove Lemma|4.3] 
4.2. Prove Lemma 


4.3. Obtain an algorithm for 3-H1TTING SET running in time 2.4656% nO) using iterative 
compression. Generalize this algorithm to obtain an algorithm for d-HiTT1NG SET running 
in time ((d — 1) + 0.4656) nO ©. 


4.4. An undirected graph G is called perfect if for every induced subgraph H of G, the size 
of the largest clique in H is the same as the chromatic number of H. We consider the Opp 
CYCLE TRANSVERSAL problem, restricted to perfect graphs. 

Recall that Exercise asked for a kernel with O(k) vertices, whereas erene el 
asked for a 3°n©()-time branching algorithm for this problem. Here we ask for a 2*nOW)- 
time algorithm based on iterative compression. 


4.5 (8). In the CLUSTER VERTEX DELETION problem, we are given a graph G and an 
integer k, and the task is to find a set X of at most k vertices of G such that G — X isa 
cluster graph (a disjoint union of cliques). Using iterative compression, obtain an algorithm 
for CLUSTER VERTEX DELETION running in time 2*nO@), 


4.6. A graph G is a split graph if V(G) can be partitioned into sets C and I, such that C 
is a clique and J is an independent set. In the SPLIT VERTEX DELETION problem, given a 
graph G and an integer k, the task is to check if one can delete at most k vertices from G 
to obtain a split graph. 

Recall that Exercise [3.20] asked for a 4*n©()-time algorithm for this problem, via a 
reduction to VERTEX Cover ABovE Matcuina. Here we ask for a 2*n©()-time algorithm 
for this problem using iterative compression. 


4.7 (È). A set X C V(G) of an undirected graph G is called an independent feedback 
vertex set if G[X] is independent and G — X is acyclic. In the INDEPENDENT FEEDBACK 
VERTEX SET problem, we are given as input a graph G and a positive integer k, and the 
objective is to test whether there exists in G an independent feedback vertex set of size 
at most k. Show that INDEPENDENT FEEDBACK VERTEX SET is in FPT by obtaining an 
algorithm with running time Bn OM), 
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4.8. Obtain a polynomial kernel for DISJOINT FEEDBACK VERTEX SET. Improve it to a 
kernel with O(k) vertices. 


4.9 ($). The Enae BIPARTIZATION problem is the edge-deletion variant of Opp CYCLE 
TRANSVERSAL: one is given a graph G and an integer k, and the task is to remove at most 
k edges from G in order to make G bipartite. Give a 2* -n©()-time algorithm for EDGE 
BIPARTIZATION using iterative compression. 


In the next few exercises we study the VARIABLE DELETION ALMOST 2-SAT problem: 
given a Boolean formula y in CNF, with every clause containing at most two literals, and 
an integer k, the task is to delete at most k variables from y to make it satisfiable; a 
variable is deleted together with all clauses containing it. It is sometimes useful to think 
of deleting a variable as setting it to both true and false at once, so that it satisfies all the 
clauses where it appears. 

In Theorem [3.12] we have shown an FPT algorithm for this problem, by a reduction 
to VERTEX Cover ABove MATCHING. Furthermore, Exercises [3.21] and [3.22] asked you 
to prove that VARIABLE DELETION ALMOST 2-SAT is equivalent to the (more standard) 
clause-deletion variant called simply ALMostT 2-SAT, where we are allowed only to delete 
k clauses. Here, we study relations between ODD CYCLE TRANSVERSAL and VARIABLE 
DELETION ALMOST 2-SAT, and develop a different FPT algorithm for VARIABLE DELE- 
TION ALMOST 2-SAT, based on iterative compression. 


4.10. Express a given ODD CYCLE TRANSVERSAL instance (G,k) as an instance (y, k) of 
VARIABLE DELETION ALMOST 2-SAT. 


4.11. Consider the following annotated variant of VARIABLE DELETION ALMOST 2-SAT, 
which we call ANNOTATED SATISFIABLE ALMOST 2-SAT: Given a satisfiable formula y, 
two sets of variables V' and V+, and an integer k, the task is to check if there exists a set 
X of at most k variables of y and a satisfying assignment w of y — X such that y(x) = T 
for every x € V! \ X and w(x) = L for every x € V+ \ X. Assuming that there exists 
a c’nO()-time FPT algorithm for ANNOTATED SaTISFIABLE ALMOST 2-SAT for some 
constant c > 1, show that VARIABLE DELETION ALMOST 2-SAT admits a (1 + 2c)*nO(- 
time algorithm. 


4.12. Express a given ANNOTATED BIPARTITE COLORING instance (G, Bi, B2,k) as an 
instance (y,V',V+,k) of ANNOTATED SATISFIABLE ALMOST 2-SAT. 


4.13. Consider the following operation of flipping a variable x in a VARIABLE DELETION 
ALMOST 2-SAT or an ANNOTATED SATISFIABLE ALMOST 2-SAT instance: we replace 
every literal x with ~z and every literal sx with x. Furthermore, in the case of ANNOTATED 
SATISFIABLE ALMOST 2-SAT, if x € V® for some a € {T, L}, then we move x to V™®. 

Using the flipping operation, show that it suffices to consider only ANNOTATED SATIS- 
FIABLE ALMOST 2-SAT instances where an assignment that assigns false to every variable 
satisfies the input formula y. That is, there are no clauses of the form z V y. 


4.14 (È). Consider the following vertex-deletion variant of the DigRapu Pair Cur prob- 
lem with a sink: given a directed graph G, designated vertices s,t € V(G), a family of pairs 
of vertices F C (re). and an integer k, check if there exists a set X C V(G)\{s, t} of size 
at most k such that t is unreachable from s in G— X and every pair in F either contains 
a vertex of X or contains a vertex that is unreachable from s in G— X. Exercise asks 
you to design a 2*n©()-time algorithm for this problem. 


1. Deduce from Exercise [4.13] that, when considering the ANNOTATED SATISFIABLE AL- 
MOST 2-SAT problem, one can consider only instances where all the clauses are of 
the form z > y or =z V my. (Remember unary clauses.) 

2. Use this insight to design a 2knO)-time algorithm for ANNOTATED SATISFIABLE 
ALMOST 2-SAT, using the aforementioned algorithm of Exercise]8.19]as a subroutine. 

3. Conclude that such a result, together with a solution to See nie a 5knOW- 
time algorithm for VARIABLE DELETION ALMOST 2-SAT. 
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Hints 


This problem is a standard example for dynamic programming. Let wp and wq be two 
sequences, of length p and q, respectively. For every 0 < i < p and 0 < j < q, we define 
T[i, j] to be the length of the longest common subsequence of substrings w,[1,...,i] and 
wg[1,..., j]. Then, T[i, j] equals the maximum of: T[i—1, j], T[i, j— 1] and, if wp[é] = walij], 
1+Tļi-1,j-— 1]. 


In the case of 3-HITTING SET, observe that the disjoint version of the problem is 
essentially a VERTEX COVER instance, as every set of size 3 contains at least one undeletable 
element. Then, use the algorithm of Theorem In the general case, generalize this 
observation: the disjoint version of the problem is also a d-HiTTING SET instance, but 
every set lost at least one element. 


Reduce the disjoint version of the problem to a minimum vertex cover problem on an 
auxiliary bipartite graph. Some insight from Exercise|4.3]may help you. 


The exercise boils down to a task of showing that the disjoint version of the problem 
is polynomial-time solvable. 

First, observe that a graph is a cluster graph if and only if it does not have an induced 
path on three vertices. In the disjoint version of the problem, given (G,S,k) such that 
|S| =k+1 and G — S is a cluster graph, we are looking for a solution of size at most k 
such that it is disjoint from S. 


We can assume that the subgraph induced on S is a cluster graph. Why? 

What can you do if there is a vertex in V(G) \ S that is adjacent to two of the clusters 
of G[S], the subgraph induced on S? Or a vertex in V(G) \ S that is adjacent to some, 
but not all vertices of one of the clusters of G[S]? 

e Now we can assume that every vertex in V(G)\S is either completely nonadjacent to S, 
or completely adjacent to exactly one cluster of G[S]. Argue that the resulting problem 
(of finding a subset of vertices to be deleted from V (G) \ S to make the resulting graph 
a cluster graph) can be solved by using an algorithm for finding a maximum matching 
of minimum cost in a bipartite graph. 


Again, the task reduces to showing that the disjoint version of the problem is 
polynomial-time solvable. 

The main observation is that a split graph on n vertices has at most n? partitions of 
the vertex set into the clique and independent set part: for a fixed one partition, at most 
one vertex can be moved from the clique side to the independent one, and at most one 
vertex can be moved in the opposite direction. (In fact one can show that it has at most 
O(n) split partitions.) Hence, you can afford guessing the “correct” partition of both the 
undeletable and the deletable part of the instance at hand. 


In short, mimic the 3.619*n°)-time algorithm for FEEDBACK VERTEX SET. 

That is, design a 4*n©°(@)-time algorithm for the disjoint version of the problem in 
the following manner. Assume we are given an instance (G, W, k) of the following disjoint 
version of INDEPENDENT FEEDBACK VERTEX SET: G — W is a forest, |W| = k + 1, and we 
seek an independent feedback vertex set X of G that is of size at most k and is disjoint 
from W. Note that we do not insist that G[W] be independent. 

A tent is a vertex v ¢ W, of degree 2, with both neighbors in W. For an instance 
I = (G,W,k), we let 7(I) be the number of tents, y(I) be the number of connected 
components of G[W], and u(I) = k + y(1) — T(J) be the potential of J. Proceed with the 
following steps. 


1. Adapt the reduction rules that deal with vertices of degree 1 and with vertices v ¢ 
W that have more than one incident edge connecting them to the same connected 
component of G[W]. 
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2. Design a reduction that deals with vertices v ¢ W of degree 2 that have exactly one 
neighbor in W. Note that you cannot simply move them to W, due to the condition 
of solutions being independent. Instead, proceed similarly as we did in the FEEDBACK 
VERTEX SET algorithm for vertices of degree 3, with exactly two neighbors in W. 

3. Finally, prove that the potential is positive on a yes-instance by adapting the proof of 
Lemma |4.12 


To perform the iterative compression step, observe that picking one endpoint of every 
edge of a solution to EDGE BIPARTIZATION yields an odd cycle transversal of the graph. Use 
this observation to reduce the problem to designing a compression routine for the following 
task: you are given a graph G with an odd cycle transversal W of size k + 1, and the task 
is to find a solution to EDGE BIPARTIZATION of size at most k, or to conclude that no such 
solution exists. Branch into 2¥+1 subproblems, guessing the right bipartition of W in the 
bipartite graph after removing the edges of the solution. Then reduce verifying existence of 
a solution in a branch to an auxiliary polynomial-time solvable cut problem, using similar 
ideas as with sets C and R in the algorithm for ODD CycLE TRANSVERSAL. 


Apply the reduction rules given for the disjoint version of the problem. Argue that if 
there are too many vertices (like KO“) or O(k)) of degree at most 2 in the deletable part, 
then the given instance is a no-instance. Else, use a standard tree-based argument to show 
that the instance is small. 


Treat every vertex as a variable. For every edge ry € E(G), introduce clauses Vy and 
ax V ay. This concludes the construction of the formula y. To show correctness, observe 
that the clauses for an edge xy force x and y to attain different values in a satisfying 
assignment. 


Using iterative compression, the task boils down to designing a (2c)*nO@)-time 
algorithm for the disjoint version of VARIABLE DELETION ALMosT 2-SAT. Thus, we have 
an instance (y,W,k), where W is a set of k + 1 variables from y such that p — W is 
satisfiable, and we seek a solution X to VARIABLE DELETION ALMOST 2-SAT on (,k) 
that is disjoint from W. 

At the cost of 2*+1 subcases, we guess the values of variables of W in some satisfying 
assignment of y — X. Now, we can do the following cleaning: discard all clauses satisfied 
by the guessed assignment, and discard the current branch if there exists a clause with all 
literals evaluated to false by the guessed assignment. Observe that every remaining clause 
C = (41 V é2) that contains a literal with a variable from W (say, ¢1) satisfies the following: 
£ı is evaluated to false by the guessed assignment, and the second literal Z2 contains a 
variable not in W. For every such clause C, we put the variable of l2 into VT if é is a 
positive literal, and otherwise we put the variable of Z> into V+. Argue that it remains to 
solve an ANNOTATED SATISFIABLE ALMOST 2-SAT instance (y — W,V',V+,k). 


Proceed as in Exercise [4.10 


The first and the last point are straightforward. Here we elaborate on the second 
point. 

We express the task of solving the input instance (y,V',V+,k) of ANNOTATED SAT- 
ISFIABLE ALMOST 2-SAT as a vertex-deletion DiGRAPH Parr Cur with a sink instance 
(G,s,t,F,k) as follows. 


Start with vertices s and t. 

For every variable x, introduce a single vertex x. 

For every clause « => y (i.e., =x V y), introduce an edge (x,y). 
For every clause ~z V ~y, introduce a pair {x,y} to the family F. 
For every vertex y € V', we introduce an edge (s, y). 

For every vertex y E€ V~, we introduce an edge (y, t). 





Ce ee Nr 
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Argue that solutions to the aforementioned node-deletion DigRAPH Parr Curt instance 
(G,s,F,k) are in one-to-one correspondence with the solutions to the original instance of 
ANNOTATED SATISFIABLE ALMOST 2-SAT. The intuition is that the nodes x reachable 
from s after the solution is removed correspond to the variables that are set to true in a 
satisfying assignment. 
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The idea of iterative compression was first introduced by Reed, Smith, and Vetta 
to prove that Opp CYCLE TRANSVERSAL is fixed-parameter tractable. The technique has 
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VERTEX SET and ALmost 2-SAT [394]. 

The algorithm for FEEDBACK VERTEX SET IN TOURNAMENTS of Theorem [4.7]is due to 
Dom, Guo, Hüffner, Niedermeier, and Truß [140]. For the FEEDBACK VERTEX SET problem, 
the parametric dependency in the running time has been systematically improved over the 
years [393] [241] [127}[77] [70], resulting in the current best deterministic algorithm running in 
time 3.592*n©@) [301]. If we allow randomization, then the problem admits an algorithm 
with running time 3% nO — we will actually see this algorithm in Section|11.2.1). 
Our simpler 5*n©°()-time algorithm for FEEDBACK VERTEX SET follows the work of Chen, 
Fomin, Liu, Lu, and Villanger [77]. The description of the 3.619*n©°™) algorithm follows 
Kociumaka and Pilipezuk [301]. The observation that DISJOINT FEEDBACK VERTEX SET 
instance becomes polynomial-time solvable if every deletable vertex is nice or a tent is due 
to Cao, Chen, and Liu [70]. 

The algorithm for CLUSTER VERTEX DELETION from Exercise|4.5]originates in the work 
of Hiiffner, Komusiewicz, Moser, and Niedermeier [271]. A 2% -n°@) algorithm for EDGE 
BIPARTIZATION was first given by Guo, Gramm, Hiiffner, Niedermeier, and Wernicke [241]; 
their algorithm is actually quite different from the one sketched in the hint to Exercise|4.9 
The parameterized complexity of ALMosT 2-SAT was open for some time until Razgon 
and O’Sullivan gave a 15*n°@) algorithm. The current fastest algorithm for this problem 
runs in time 2.315*n°() [828]. The algorithm for ALMost 2-SAT from Exercise 
originates in the work of Kratsch and Wahlstrém [310]; they gave a polynomial kernel for 
the problem, and the algorithm follows from their ideas implicitly. 


In this book we will again use iterative compression to solve PLANAR VERTEX DELETION 
in Chapter [7] and DIRECTED FEEDBACK VERTEX SET in Chapter [8] 


Chapter 5 


Randomized methods in parameterized 
algorithms 


In this chapter, we study how the powerful paradigm 
of randomization can help in designing FPT algo- 
rithms. We introduce the general techniques of color 
coding, divide and color, and chromatic coding, and 
show the use of these techniques on the problems 
LONGEST PATH and d-CLUSTERING. We discuss also 
how these randomized algorithms can be derandomized 
using standard techniques. 





We give a few commonly used approaches to design randomized FPT al- 
gorithms. 

In Section [5.1] we start with a simple example of an algorithm for FEED- 
BACK VERTEX SET. Here, we essentially observe that, after applying a few 
easy reduction rules, a large fraction of the edges of the graph needs to be 
adjacent to the solution vertices — and, consequently, taking into the solu- 
tion a randomly chosen endpoint of a randomly chosen edge leads to a good 
success probability. 

Then, in Section we move to the very successful technique of color 
coding, where one randomly colors a universe (e.g., the vertex set of the in- 
put graph) with a carefully chosen number of colors and argue that, with 
sufficient probability, a solution we are looking for is somehow “properly col- 
ored”. The problem-dependant notion of “properly colored” helps us resolve 
the problem if we only look for properly colored solutions. For resolving the 
colored version of the problem, we use basic tools learned in the previous 
chapters, such as bounded search trees. In this section we start with the clas- 
sic example of a color coding algorithm for LONGEST PATH. Then, we show 
how a random partition (i.e., a random coloring with two colors) can be used 
to highlight a solution, using an example of the SUBGRAPH ISOMORPHISM 
problem in bounded degree graphs. Moreover, we present the so-called divide 
and color technique for LONGEST PATH, where a recursive random partition- 
ing scheme allows us to obtain a better running time than the simple color 
coding approach. We conclude this section with a more advanced example of 
a chromatic coding approach for d-CLUSTERING. 
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Finally, in Section we briefly discuss the methods of derandomizing 
algorithms based on color coding. We recall the necessary results on con- 
structing pseudorandom objects such as splitters, perfect hash families, uni- 
versal sets and small k-wise independent sample spaces, and show how to 
replace the random coloring step of (variants of) the color coding approach 
by iterating over a carefully chosen pseudorandom object. 

Let us briefly recall the principles of randomization in algorithms. We 
assume that the algorithm is given access, apart from the input, to a stream 
of random bits. If the algorithm reads at most r random bits on the given 
input, then the probability space is the set of all 2” possible strings of random 
bits read by the algorithm, with uniform probability distribution (i.e., each 
bit is chosen independently and uniformly at random). Whenever we say 
that “an algorithm does X with probability at least /at most p’, we mean the 
probability measured in this probability space. 

Consider an algorithm for a decision problem which given a no-instance 
always returns “no,” and given a yes-instance returns “yes” with probability 

€ (0,1). Such an algorithm is called a one-sided error Monte Carlo algorithm 
with false negatives. 

Sometimes we would like to improve the success probability p, especially 
in our applications where we often have bounds like p = 1/20), or, more 
generally, p = 1/ f(k) for some computable function f. It is not difficult to 
see that we can improve the success probability at the price of worse running 
time. Namely, we get a new algorithm by repeating the original algorithm t 
times and returning “no” only if the original algorithm returned “no” in each of 
the t repetitions. Clearly, given a no-instance the new algorithm still returns 
“no.” However, given a yes-instance, it returns “no” only if all t repetitions 
returned an incorrect “no” answer, which has probability at most 


(1 = p) < (eY = 1/e™, 


where we used the well-known inequality 1 + x < e”. It follows that the new 
success probability is at least 1 — 1/e?. Note that in particular it suffices to 
put t= [5] to get constant success probabilityf | 


If a one-sided error Monte Carlo algorithm has success probability at 
least p, then repeating it independently I times gives constant success 
probability. In particular, if p = 1/f(k) for some computable function 
f, then we get an FPT one-sided error Monte Carlo algorithm with 
additional f(k) overhead in the running time bound. 





1 In this chapter, by “constant success probability” or “constant error probability’ we mean 
that the success probability of an algorithm is lower bounded by a positive universal con- 
stant (or, equivalently, the error probability is upper bounded by some universal constant 
strictly smaller than 1). 
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In the area of polynomial-time algorithms, quite often we develop an algo- 
rithm that does something useful (e.g., solves the problem) with probability at 
least p, where 1/p is bounded polynomially in the input size. If this is the case, 
then we can repeat the algorithm a polynomial number of times, obtaining 
a constant error probability. In the FPT world, the same principle applies; 
however, now the threshold of “useful probability” becomes 1/(f(k)nO™). 
That is, if we develop an algorithm that runs in FPT time and solves the 
problem with probability at least 1/(f(k)nO), then repeating the algorithm 
f(k)n°® times gives us constant error probability, while still maintaining 
an FPT running time bound. This is the goal of most of the algorithms in 
this chapter. 


5.1 A simple randomized algorithm for FEEDBACK 
VERTEX SET 


In this section, we design a simple randomized algorithm for FEEDBACK VER- 
TEX SET. As discussed in Section 3.3] when tackling with FEEDBACK VER- 
TEX SET, it is more convenient to work with multigraphs, not only simple 
graphs. Recall that both a double edge and a loop are cycles, and we use the 
convention that a loop at a vertex v contributes 2 to the degree of v. 

The following crucial lemma observes that, once we apply to the input 
instance basic reduction rules that deal with low-degree vertices, many edges 
of the graph are incident to the solution vertices. 


Lemma 5.1. Let G be a multigraph on n vertices, with minimum degree at 
least 3. Then, for every feedback vertex set X of G, more than half of the 
edges of G have at least one endpoint in X. 


Proof. Let H = G — X. Since every edge in E(G) \ E(H) is incident to a 
vertex in X, the claim of the lemma is equivalent to |E(G)\ E(H)| > |E(A)|. 
However, since H is a forest, |V(H)| > |E(H)| and it suffices to show that 
\E(G) \ E(H)| > |V(H)]. 

Let J denote the set of edges with one endpoint in X and the other in 
V(#). Let V<ı, V2 and V> denote the set of vertices in V (H) such that they 
have degree at most 1, exactly 2, and at least 3 in H, respectively. (Note that 
we use here the degree of a vertex in the forest H, not in the input graph G.) 
Since G has minimum degree at least 3, every vertex in V<; contributes at 
least two distinct edges to J. Similarly, each vertex in V2 contributes at least 
one edge to J. As H is a forest, we have also that |Vs3| < |V<j|. Putting all 
these bounds together, we obtain: 


102 5 Randomized methods in parameterized algorithms 


|E(G) \ E(H)| > |J] 
2 2|Vei| + [V2] > [Vea] + [V2] + [Vəsl 
= |V(H)]|. 


This concludes the proof of the lemma. 














Recall that in Section we have developed the simple reduction rules 
[FVS.IHFVS.5]that reduce all vertices of degree at most 2 in the input graph. 
Lemma|5.1|says that, once such a reduction has been performed, a majority 
of the edges have at least one endpoint in a solution. Hence, if we pick an edge 
uniformly at random, and then independently pick its random endpoint, with 
probability at least 1/4 we would pick a vertex from the solution. By iterating 
this process, we obtain an algorithm that solves the input FEEDBACK VERTEX 
SET instance in polynomial time, with probability at least 47". 


Theorem 5.2. There exists a polynomial-time randomized algorithm that, 
given a FEEDBACK VERTEX SET instance (G,k), either reports a failure or 
finds a feedback vertex set in G of size at most k. Moreover, if the algorithm 
is given a yes-instance, it returns a solution with probability at least 4—*. 


Proof. We describe the algorithm as a recursive procedure that, given a graph 
G and an integer k, aims at a feedback vertex set of G of size at most k. 

We first apply exhaustively Reductions [FVS.1}FVS.5|to the FEEDBACK 
VERTEX SET instance (G,k). If the reductions conclude that we are dealing 
with a no-instance, then we report a failure. Otherwise, let (G’,k’) be the 
reduced instance. Note that 0 < k’ < k and G” has minimum degree at least 
3. Let Xo be the set of vertices deleted due to Reduction [FVS.1] Note that 
the vertices of Xo have been qualified as mandatory vertices for a feedback 
vertex set in G, that is, there exists a feedback vertex set in G of minimum 
possible size that contains Xo, and for any feedback vertex set X’ of G’, 
X’ U Xo is a feedback vertex set of G. Moreover, |Xo| = k — k’. 

If G’ is an empty graph, then we return Xo; note that in this case |Xo| < k 
as k’ > 0, and Xo is a feedback vertex set of G. Otherwise, we pick an edge e of 
G” uniformly at random (i.e., each edge is chosen with probability 1/|E(G’)|), 
and choose one endpoint of e independently and uniformly at random. Let v 
be the chosen endpoint. We recurse on (G’ — v, k’ — 1). If the recursive step 
returns a failure, then we return a failure as well. If the recursive step returns 
a feedback vertex set X’, then we return X := X’ U {v} U Xo. 

Note that in the second case the set X’ is a feedback vertex set. of G” — v of 
size at most k’ — 1. First, observe that the size bound on X’ implies |X| < k. 
Second, we infer that X’U{v} is a feedback vertex set of G” and, consequently, 
X is a feedback vertex set of G. Hence, the algorithm always reports a failure 
or returns a feedback vertex set of G of size at most k. It remains to argue 
about the probability bound; we prove it by induction on k. 

Assume that there exists a feedback vertex set X of G of size at most 
k. By the analysis of the reduction rules of Section Reduction [FVS.5]is 


5.2 Color coding 103 


not triggered, the instance (G’,k’) is computed and there exists a feedback 
vertex set X’ of G’ of size at most k’. If G’ is empty, then the algorithm 
returns Xo deterministically. Otherwise, since G” has minimum degree at 
least 3, Lemma |5.1|implies that, with probability larger than 1/2, the edge e 
has at least one endpoint in X’. Consequently, with probability larger than 
1/4, the chosen vertex v belongs to X’, and X’ \ {v} is a feedback vertex 
set of size at most k’ — 1 in the graph G” — v. By the inductive hypothesis, 
the recursive call finds a feedback vertex set of G” — v of size at most k’ — 1 
(not necessarily the set X’ \ {v}) with probability at least 4~‘*'—). Hence, a 
feedback vertex set of G of size at most k is found with probability at least 
; -4-(k'-1) = 4-® > 4-*. This concludes the inductive step, and finishes the 
proof of the theorem. 

















As discussed at the beginning of this section, we can repeat the algorithm 
of Theorem [5.2|independently A* times to obtain a constant error probability. 


Corollary 5.3. There exists a randomized algorithm that, given a FEED- 
BACK VERTEX SET instance (G, k), in time 4*n° either reports a failure 
or finds a feedback vertex set in G of size at most k. Moreover, if the algorithm 
is given a yes-instance, it returns a solution with a constant probability. 


5.2 Color coding 


The technique of color coding was introduced by Alon, Yuster and Zwick to 
handle the problem of detecting a small subgraph in a large input graph. More 
formally, given a k-vertex “pattern” graph H and an n-vertex input graph G, 
the goal is to find a subgraph of G isomorphic to H. A brute-force approach 
solves this problem in time roughly O(n"); the color coding technique ap- 
proach allows us to obtain an FPT running time bound of 20)n°) in the 
case when H is a forest or, more generally, when H is of constant treewidth 
(for more on treewidth, we refer you to Chapter [7). We remark here that, as 
we discuss in Chapter [13] such an improvement is most likely not possible in 
general, as the case of H being a k-vertex clique is conjectured to be hard. 

The idea behind the color coding technique is to randomly color the entire 
graph with a set of colors with the number of colors chosen in a way that, 
if the smaller graph does exist in this graph as a subgraph, then with high 
probability it will be colored in a way that we can find it efficiently. In what 
follows we mostly focus on a simplest, but very instructive case of H being a 
k-vertex path. 

We remark that most algorithms based on the color coding technique can 
be derandomized using splitters and similar pseudorandom objects. We dis- 
cuss these methods in Section 


104 5 Randomized methods in parameterized algorithms 


5.2.1 A color coding algorithm for LONGEST PATH 


In the LONGEST PATH problem, we are given a graph G and a positive integer 
k as input, and the objective is to test whether there exists a simple path on 
k vertices in the graph G. Note that this corresponds to the aforementioned 
“pattern” graph search problem with H being a path on k vertices (henceforth 
called a k-path for brevity). 

Observe that finding a walk on k vertices in a directed graph is a simple 
task; the hardness of the LONGEST PATH problem lies in the requirement that 
we look for a simple path. A direct approach involves keeping track of the 
vertices already visited, requiring an (2) factor in the running time bound. 
The color coding technique is exactly the trick to avoid such a dependency. 


Color the vertices uniformly at random from {1,..., k}, and find a path 
on k vertices, if it exists, whose all colors are pairwise distinct. 


The essence of the color coding technique is the observation that, if we 
color some universe with k colors uniformly at random, then a given k-element 
subset is colored with distinct colors with sufficient probability. 


Lemma 5.4. Let U be a set of size n, and let X C U be a subset of size 
k. Let x : U — [|k] be a coloring of the elements of U, chosen uniformly at 
random (i.e., each element of U is colored with one of k colors uniformly and 
independently at random). Then the probability that the elements of X are 
colored with pairwise distinct colors is at least e~*. 


Proof. There are k” possible colorings y, and k!k"—" of them are injective 
on X. The lemma follows from the well-known inequality k! > (k/e)*. 














Hence, the color coding step reduces the case of finding a k-path in a graph 
to finding a colorful k-path in a vertex-colored graph. (In what follows, a path 
is called colorful if all vertices of the path are colored with pairwise distinct 
colors.) Observe that this step replaces the (2) factor, needed to keep track 
of the used vertices in a brute-force approach, with a much better 2* factor, 
needed to keep track of used colors. As the next lemma shows, in the case of 
finding a colorful path, the algorithm is relatively simple. 


Lemma 5.5. Let G be a directed or an undirected graph, and let x : V (G) > 
[k] be a coloring of its vertices with k colors. There exists a deterministic 
algorithm that checks in time 2'n°™ whether G contains a colorful path on 
k vertices and, if this is the case, returns one such path. 


Proof. Let Vi,...,V~ be a partitioning of V(G) such that all vertices in V; 
are colored i. We apply dynamic programming: for a nonempty subset S of 


{1,...,k} and a vertex u € Uj;eg Vi, we define the Boolean value PATH(S, u) 
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to be equal to true if there is a colorful path with vertices colored with all 
colors from S$ and with an endpoint in u. For |S| = 1, note that PATH(S, u) 
is true for any u € V(G) if and only if S = {x(u)}. For |S| > 1, the following 
recurrence holds: 


PATH(S, u) = es : wE E(G)} if x(u) eS 


False otherwise. 


Indeed, if there is a colorful path ending at u using all colors from S, then 
there has to be colorful path ending at a neighbor v of u and using all colors 
from S \ {x(w)}. 

Clearly, all values of PATH can be computed in time 2'n°™ by applying 
the above recurrence, and, moreover, there exists a colorful k-path in G if 
and only if PATH([k],v) is true for some vertex v € V(G). Furthermore, a 
colorful path can be retrieved using the standard technique of backlinks in 
dynamic programming. 














We now combine Lemmas and [5.5] to obtain the main result of this 


section. 


Theorem 5.6. There exists a randomized algorithm that, given a LONGEST 
PATH instance (G,k), in time (2e)*nO™ either reports a failure or finds a 
path on k vertices in G. Moreover, if the algorithm is given a yes-instance, 
it returns a solution with a constant probability. 


Proof. We show an algorithm that runs in time 2*n°“), and, given a yes- 
instance, returns a solution with probability at least e~". Clearly, by repeat- 
ing the algorithm independently e} times, we obtain the running time bound 
and success probability guarantee promised by the theorem statement. 

Given an input instance (G,k), we uniformly at random color the vertices 
of V(G) with colors [k]. That is, every vertex is colored independently with 
one color from the set [k] with uniform probability. Denote the obtained 
coloring by x : V(G) — [k]. We run the algorithm of Lemma [5.5] on the 
graph G with coloring y. If it returns a colorful path, then we return this 
path as a simple path in G. Otherwise, we report failure. 

Clearly, any path returned by the algorithm is a k-path in G. It remains 
to bound the probability of finding a path in the case (G, k) is a yes-instance. 

To this end, suppose G has a path P on k vertices. By Lemma [5.4] P 
becomes a colorful path in the coloring x with probability at least e~”. If this 
is the case, the algorithm of Lemma[5.5]finds a colorful path (not necessarily 
P itself), and the algorithm returns a k-path in Œ. This concludes the proof 
of the theorem. 
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5.3 Random separation 


In this section we give a variant of the color coding technique that is par- 
ticularly useful in designing parameterized algorithms on graphs of bounded 
degree. The technique, usually called random separation in the literature, 
boils down to a simple but fruitful observation that in some cases if we ran- 
domly partition the vertex or edge set of a graph into two sets, the solution 
we are looking for gets highlighted with good probability. 

We illustrate the method on the SUBGRAPH ISOMORPHISM problem, where 
we are given an n-vertex graph G and a k-vertex graph H, and the objective 
is to test whether there exists a subgraph H of G such that H is isomorphic 
to H. Observe that LONGEST PATH is a special case of SUBGRAPH ISOMOR- 
PHISM where H is a path on k vertices. Exercise [5.2] asks you to generalize 
the color coding approach for LONGEST PATH to TREE SUBGRAPH ISOMOR- 
PHISM, where H is restricted to being a tree, whereas Exercise [7.48] takes 
this generalization further, and assumes H has bounded treewidth (the no- 
tion of treewidth, which measures resemblance of a graph to a tree, is the 
topic of Chapter [7). Also, observe that the CLIQUE problem is a special case 
of SUBGRAPH ISOMORPHISM, where H is a clique on k vertices. It is be- 
lieved that CLIQUE is not FPT (see Chapter [13}, and, consequently, we do 
not expect that the general SUBGRAPH ISOMORPHISM problem is FPT when 
parameterized by k. 

In this section we restrict ourselves to graphs of bounded degree, and show 
that if the degree of G is bounded by d, then SUBGRAPH ISOMORPHISM can be 
solved in time f(d,k)n°“ for some computable function f. In other words, 
SUBGRAPH ISOMORPHISM is FPT when parameterized by both k and d. 


The idea of random separation is to color edges (vertices) randomly 
such that the edges (vertices) of the solution are colored with one color 
and the edges (vertices) that are adjacent to edges (vertices) of the 
solution subgraph get colored with a different color. That is, we separate 
the solution subgraph from its neighboring edges and vertices using a 
random coloring. 

In SUBGRAPH ISOMORPHISM, if we perform a successful coloring step, 
then every connected component of the pattern graph H corresponds 
to one connected component of the subgraph induced by the first color 
in the colored graph G. 


Let us now focus on the SUBGRAPH ISOMORPHISM problem. We first do 
a sanity test and check whether the maximum degree of H is at most d; if 
this is not the case, then we immediately report that the given instance is a 
no-instance. Let us color independently every edge of G in one of two colors, 
say red and blue (denoted by R and B), with probability ł each. Denote 
the obtained random coloring by x : E(G) —> {R, B}. Suppose that (G, H) 
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is a yes-instance. That is, there exists a subgraph H in G such that H is 
isomorphic to H. 

Let I’ denote the set of edges that are incident to some vertex of V(H), 
but do not belong to E(H). We say that a coloring x is successful if both the 
following two conditions hold: 

1. every edge of E(#) is colored red, that is, E(B) C x71} (R); and 
2. every edge of I is colored blue, that is, l C y~!(B). 


Observe that E(#) and T are disjoint. Thus, the two aforementioned condi- 
tions are independent. Furthermore, since every edge of E (i JUT is incident 
to a vertex of V(A), \V(H)| = |V(H)| = k, and the maximum degree of G 
is at most d, we obtain 


|E(H)| + |T] < d|V(H)| < dk. 
Consequently, the probability that x is successful is at least 


1 Ss 1 
QIE(A)|+/r| T 24k" 





Let Gr be the subgraph of G containing only those edges that have been 
colored red, that is, Gp = (V(G),x~1(R)). The core observation now is as 
follows: if x is successful, then Hisa subgraph of Gpr as well and, moreover, 
H consists of a number of connected components of Gp. 

For simplicity assume first that H is connected. Thus, if x is successful, 
then H is a connected component of Gr. Consequently, we can go over all 
connected components C of Gr with exactly k vertices and test if C is iso- 
morphic to H. Such a single test can be done by brute force in time k!kO™, or 
we can use a more involved algorithm for GRAPH ISOMORPHISM on bounded 
degree graphs that runs in time kO(4!°s® on k-vertex graphs of maximum 
degree d. 

Let us now consider the general case when H is not necessarily connected. 
Let Hı, H2,..., Hp be the connected components of H, and let C1, C2,...,C> 
be the connected components of Gr. We construct an auxiliary bipartite 
graph B(H, Gpr), where every vertex corresponds to a connected component 
H, or a connected component Cj, and H; is adjacent to C} if they have at most 
k vertices each and are isomorphic. Observe that the graph B(H, Gpr) can 
be constructed in time k!nOC™ or in time kO(4!°8%nO, depending on the 
GRAPH ISOMORPHISM algorithm we use, and the subgraph H isomorphic to 
H corresponds to a matching in B(H, Gpr) that saturates {H), Ho,..., Hp}. 
(Note that every component of B(H, Gpr) is a complete bipartite graph, so 
the matching can be found using a trivial algorithm.) Consequently, we can 
check if there is a subgraph of Gz isomorphic to H that consists of a number 
of connected components of Gig in time k!n°™ or in time kOl d) nO), 
This completes the description the algorithm. 
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We have proved that the probability that y is successful is at least 27%. 
Hence, to obtain a Monte Carlo algorithm with false negatives we repeat the 
above procedure 2% times, and obtain the following result: 


Theorem 5.7. There exist Monte Carlo algorithms with false negatives that 
solve the input SUBGRAPH ISOMORPHISM instance (G, H) in time 24% k!nOW 
and in time 24° kO 484) nO) Here, |V(G)| =n, |V(H)| = k, and the maz- 
imum degree of G is bounded by d. 


*5.4 A divide and color algorithm for LONGEST PATH 


We now improve the running time bound of the algorithm for LONGEST 
PATH, by using a different choice of coloring. The idea is inspired by the 
“divide and conquer” approach: one of the basic algorithmic techniques to 
design polynomial-time algorithms. In other words, we will see a technique 
that is an amalgamation of divide and conquer and color coding. Recall that 
for color coding we used k colors to make the subgraph we are seeking colorful. 
A natural question is: Do we always need k colors, or in some cases can we 
reduce the number of colors and hence the randomness used in the algorithm? 
Some problems can naturally be decomposed into disjoint pieces and solutions 
to individual pieces can be combined together to get the complete solution. 
The idea of divide and color is to use randomization to separate the pieces. 
For example, in the case of graph problems, we will randomly partition all 
vertices (or edges) of a graph into the left and the right side, and show that 
the structure we were looking for has been conveniently split between the 
sides. We solve the problem recursively on a graph induced on left and right 
sides separately and then combine them to get the structure we are searching 
for. 

Thus, the workflow of the algorithm is a bit different from the one in the 
previous color coding example. We perform the partition step at every node 
of the recursion tree, solving recursively the same problem in subinstances. 
The leaves of the recursion tree correspond to trivial instances with k = O(1), 
where no involved work is needed. On the other hand, the work needed to 
glue the information obtained from subinstances to obtain a solution for the 
current instance can be seen as a variant of dynamic programming. 


The basic idea of the divide and color technique for the LONGEST PATH 
problem is to randomly assign each vertex of the graph G to either a set 
L (left) or another set R (right) with equal probability and thus obtain 
a partitioning of the vertices of the input graph. By doing this, we hope 
that there is a k-path P such that the first [£] vertices of P are in L 
and the last |£| vertices of P are in R. Observe that for a fixed k-path 
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SIMPLE-RANDOMIZED-PaTuHs(X, £) 
Input: A subset X C V(G) and an integer 4,1 < £< k 
Output: Dx,e 


If £ = 1, then return Dx ov, v] = T for all v € X, L otherwise. 
Uniformly at random partition X into L and R. 
D, p£] TSIMPLE-RANDOMIZED-PATHS(L, EAD 

T 


D | ¢ :=SIMPLE-RANDOMIZED-PATHS(R, | £ J) 
R,|3] N 2 


GE PE OS N TEA 


Return Dx, := Dire Pa Days 


Fig. 5.1: A simple algorithm to find a path on k vertices 











this happens with probability exactly 2—". After the partitioning we 
recurse on the two induced graphs G[L] and G[R]. 


However, the naive implementation of this scheme gives a success proba- 
bility worse than 2~°*), and hence cannot be used to obtain an algorithm 
with running time 20%)n°™ and constant success probability. We need two 
additional ideas to make this scheme work. First, we need to define the recur- 
sion step in a way that it considers all possible pairs of endpoints for the path. 
We present first an algorithm implementing this idea. Then we improve the 
algorithm further by observing that we can do better than selecting a single 
random partition in each recursion step and then repeating the whole al- 
gorithm several times to increase the probability of success. Instead, in each 
recursive step, we create several random partitions and make several recursive 
calls. 

In order to combine the subpaths obtained from G[L] and G[R], we need 
more information than just one k-vertex path from each side. For a given 
subset X C V(G), a number £ € {1,...,k} and a pair of vertices u and v 
of X let Dx, [u,v] denote a Boolean value equal to true if and only if there 
exists an ¢-vertex path from u to v in G[X]. Note that for a fixed subset X, 
we can consider Dx as an |X| x |X| matrix. In what follows, we describe a 
procedure SIMPLE-RANDOMIZED-PATHS(X, £) that computes a matrix Dx. 
The relation between Dya and Dy is the following. If Dx elu, v] is true 
for some u,v € X, then so is Dx ¢[u,v]. The crux of the method will be in 
ensuring that if Dx [u,v] is true for some fixed u and v, then Dx elu, v] is 
also true with sufficiently high probability. Thus, we will get a one-sided error 
Monte Carlo algorithm. 

Given a partition (L, R) of X, an |L| x |L| matrix A, and an |R| x |R] 
matrix B, we define A ™ B as an |X| x |X| Boolean matrix D. For every pair 
of vertices u,v € X we have D[u, v] equal to true if and only if u € L, v € R 
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and there is an edge zy € E(G[X]) such that x € L, y € Rand both Alu, z] 
and Bly, v] are set to true. Then, (Dr f£] hI Dpr] )[u, v] is true if and only if 
there exists an -path P in G[X] from u to v, whose first [$] vertices belong 
to L and the remaining [4] vertices belong to R. In particular, (Dr rs D< 
Dre) )[us v] implies Dx ¢[u,v]. The main observation is that if Dx e[u, v] 
is true, then one can hope that the random choice of L and R partitions 
the vertices of the path P correctly, i.e., so that (Dr fe D< Dpr, gj)lu, 0] 
is also true with some sufficiently large probability. Thus, we compute the 
matrices Di fe and Dragy recursively by invoking SIMPLE-RANDOMIZED- 


Parus(L,[£]) and SIMPLE-RANDOMIZED-PATHS(R, | J), respectively, and 
then return Dx = D, rg] P< Dy Lgj For the base case in this recurrence, 


we take £ = 1 and compute Dx; = Dx, directly from the definition. 

Let us analyze the probability that, if Dx,eļu,v] is true for some u,v € 
V (G), then Dx elu, v] is also true. For fixed £, by pe denote the infinimum of 
this probability for all graphs G, all sets X C V(G), and all vertices u,v € X. 
If Z = 1, then Dx = Dx and, consequently, pı = 1. Otherwise, let P be 
any ¢-path in GLX] with endpoints u and v; we now analyze how the path P 
can be “detected” by the algorithm. Let x be the [$]-th vertex on P (counting 
from u), and let y be the successor of z on P. Moreover, let Pr be the subpath 
of P between u and z, inclusive, and let Pr be the subpath of P between y 
and v, inclusive. Note that, Py has [4] vertices, Pp has |£] vertices and, as 
é>1, lf) <[h1<e 

Observe that, with probability 27%, all vertices of Pr are assigned to L 
and all vertices of Pr are assigned to R. Moreover, if this is the case, then 
both Drt [u, x] and Drg [y, v] are true, with the paths Pr and Pr being 


the respective witnesses. Consequently, Dite [u, x] and Dris [y, v] are both 


true with probability at least PePe: Tf this is the case, Dy elu, v] is set 
to true by the definition of the > product. Hence, we obtain the following 
recursive bound: 

Pe 2 2 PrP (5.1) 


By solving the recurrence (5.1), we obtain pẹ = 2~°(l°s4_ To see this, 
observe that, in the search for the path P for the value Dx »[u,v], there 
are l — 1 partitions that the algorithm SIMPLE-RANDOMIZED-PATHS needs 
to find correctly: one partition of all @ vertices, two partitions of roughly 
é/2 vertices, four partitions of roughly ¢/4 vertices, etc. That is, for each 
i=0,1,..., [log | —1, the algorithm needs to make, roughly 2' times, a cor- 
rect partition of roughly k/2’ vertices. Consequently, the success probability 
of the algorithm is given by 


log £-1 1 9% 

a _ 9—O(Llog 2) 

Pe lI (saa) =2 ae 
i=0 
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FASTER-RANDOMIZED-PaTus(X, £) 
Input: A subset X C V(G) and an integer 4,1 < £< k 
Output: Dx,e 


1. If €=1, then return Dx ¢[v,v] = T for all v € X, L otherwise. 


2. Set Dx elu, v] to false for any u,v € X. 
3. Repeat the following 2° log(4k) times 


(a) Uniformly at random partition X into L and R. 
b)D, EA :=FasTER-RANDOMIZED-Patus(L, (41) 
LAN 
D 


( 
(c) DR L£] :=FASTER-RANDOMIZED-PATHS(R, L$) 
LD 
(d) Oomipute Eon Dye) mA Daisy and update 
Dx plu, v] := Dx e[u,v] V DX plu, v] for every u,v € V(G). 


4. Return Dx. 


Fig. 5.2: A faster algorithm to find a path on k-vertices 











Thus, to achieve a constant success probability, we need to run the algorithm 
SIMPLE-RANDOMIZED-PATHS(V(G), k) 20 108K) number of times, obtaining 
a significantly worse running time bound than the one of Theorem |5.6 

Let us now try to improve the running time of our algorithm. The main 
idea is that instead of repeating the entire process 20(*!s*) times, we split 
the repetitions between the recursive calls. More precisely, we choose some 
function f(@,&) and, in each recursive call, we try not only one random parti- 
tion of V (G) into L and R, but f(¢,k) partitions, each chosen independently 
at random. Recall that £ denotes the argument of the recursive call (the 
length of paths we are currently looking for), whereas k denotes the argu- 
ment of the root call to FASTER-RANDOMIZED-PATHS (the length of a path 
we are looking for in the entire algorithm). In this manner, we increase the 
running time of the algorithm, but at the same time we increase the success 
probability. Let us now analyze what function f(¢,k) guarantees constant 
success probability. 

Recall that a fixed partition of the vertices into L and R suits our needs 
for recursion (i.e., correctly partitions the vertex set of one fixed ¢-path) with 
probability 2~°. Hence, if we try f(¢,k) partitions chosen independently at 
random, the probability that none of the f(¢,k) partitions suits our needs is 


at most 
1 \ fe) 
oo 62) 


Fix a k-vertex path P witnessing that Dyg),,[u,v] is true for some fixed 
u,v € V(G). The main observation is that, although the entire recursion tree 
is huge — as we shall later see, it will be of size roughly 4* — we do not need 
to be successful in all nodes of the recursion tree. First, we need to guess 
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correctly at least one partition in the root node of the recursion tree. Then, 
we may limit ourselves to the two recursive calls made for that one particular 
correct partition at the root node, and insist that in these two recursive calls 
we guess at least one correct partition of the corresponding subpaths of length 
[k/2] and |k/2]. If we proceed with such a reasoning up to the leaves of the 
recursion, we infer that we need to guess correctly at least one partition at 
exactly k — 1 nodes. Consequently, we obtain constant success probability if 
the value of the expression (5.2) is at most HEL for every 1 < £ < k. To 
achieve this bound, observe that we can choose f(é,k) = 2° log(4k). Indeed, 
since 1+ a < e” for every real z, 


A 1 2° log(4k) P -a 2° log(4k) 1 y 1 
T 2 = Tosldk) op 
z = (e ) elog(4k) — 2k` 


Since the probability space in the algorithm is quite complex, some readers 
may find the argumentation in the previous paragraph too informal. For sake 
of completeness, we provide a formal argumentation in the next lemma. 


Lemma 5.8. If (G,k) is a yes-instance and f(L, k) = 2°log(4k) (i.e., the 
value of the expression is at most Ca for every 1 < £ < k), then 
with probability at least 5 there exists a pair of vertices u,v € V(G) with 
Dye) x(t, v] equal to true. 


Proof. Consider a set X C V(G), vertices u,v € V(G), an integer 1 < £ < k, 
and an é-vertex path P in G[X] with endpoints u and v. Clearly, P witnesses 
that Dx [u,v] = 1. Let pf denote the infinimum, for fixed k and £, over 
all choices of G, X, u, and v such that Dx [u,v] = 1, of the probability 
that a call to FASTER-RANDOMIZED-PATHS(X, £) on instance (G, k) returns 
Dx elu, v] = 1. Let s = TD We prove by induction on £ that for every 
1 << k we have p} > 1— (l — 1)s; note that the lemma follows from such 
a claim for £ = k. In the base case, observe that pk =las Dxi = Dx, for 
every X C V(G). 

For the induction step, consider a path P in G[X], with £ > 1 vertices 
and endpoints u and v. Observe that at the call FASTER-RANDOMIZED- 
PATHS(X, £) to set Dx e[u,v] = 1 it suffices to (a) at least once guess a 
correct partition for the path P; and (b) for the first such correct partition 
(L, R) compute Dr feja) [u,z] = 1 and Dre? [y,v] = 1, where x is the 
[£]-th vertex on P (counting from u), and y is the successor of x on P. 
Consequently, 








( 
> (1—s)-(1— ([£/2] — 1)s)- @ = ([4/2] = 1)s) 
> 1—s— ([é/2] — 1)s — ([¢/2] — 1)s 

1 
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Here, the second inequality follows from the inductive assumption, as £ > 1 
implies |¢/2| < [@/2] < £. 














For ease of presentation, the modified algorithm with improved error prob- 
ability is given on Fig. 

Let us now analyze the running time of the algorithm: due to the choice 
of f(é,k) = 2°log(4k), it is no longer polynomial. We have the following 
recursive formula: 


T(n, £, k) < 2log(4k) (7 (r š] £) +7 (r 5 .t)) +n®., (5.3) 


This solves to T(n, £, k) = 46+°(+*)n O°, and we obtain the following im- 
provement upon Theorem |5.6 


Theorem 5.9. There exists a randomized algorithm that, given a LONGEST 
PATH instance (G, k), in time Ak+o(k)nO either reports a failure or finds a 
path on k vertices in G. Moreover, if the algorithm is given a yes-instance, 
it returns a solution with a constant probability. 


5.5 A chromatic coding algorithm for d-CLUSTERING 


We now move our attention to edge modification problems, where we are to 
modify the input graph G by adding or deleting a small number of edges to 
obtain a desired property (i.e., make a graph belong to a prescribed graph 
class). The idea of chromatic coding (or color and conquer, as it is sometimes 
referred to in the literature) is to randomly color the vertices of the input 
graph, with the hope that for every edge uv of the solution, i.e., the set of 
edges that have to be added/deleted, the colors assigned to u and v will 
differ. Once this random coloring step is successfully completed, the problem 
instance seems to have a nice structure which can be exploited for algorithmic 
applications. In particular, after a successful coloring, we can observe the 
following. 


1. All the solution edges go across color classes. 

2. Therefore, no modification happens within a color class, hence the sub- 
graph induced by each color class is already a member of the target graph 
class. 


While in some cases such properties significantly help us in resolving 
the problem at hand, the crux of the chromatic coding approach is that, 
in a yes-instance where the number of modifications in the solution is 
bounded by k, the number of colors that guarantee the desired property 
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with good probability is sublinear in k. Consequently, the algorithms 
obtained via chromatic coding are usually subexponential, i.e., they run 
in 2°) nO time. 


As we discuss in Chapter[I4] subexponential parameterized algorithms are 
relatively rare. 


Probability of a good coloring. Before we dive into a specific exemplary 
problem for chromatic coding, let us formally analyze the random coloring 
step — as it is generic for all algorithms using this approach. For a graph 
G, we say that a coloring x : V(G) — [|q] properly colors E(G) if for every 
uv € E(G) we have x(u) 4 x(v). 


Lemma 5.10. If the vertices of a simple graph G on k edges are colored in- 
dependently and uniformly at random with [V8k] colors, then the probability 


that E(G) is properly colored is at least 2~ V ne 


Proof. Let n = |V(G)|. Consider the sequence of vertices generated by the 
following procedure. 


1. Initialize Go = G. 

2. For every i = 1,2,...,n, repeat the following procedure. Let v; be a 
vertex of minimum possible degree in G;_1. Remove v; from G;_, to 
obtain the graph G; (i.e., define G; := G;_-1 — vi). 


Observe that G, is empty. 
Let d; be the degree of v; in Gi—1. Since we are dealing with simple graphs, 
for every i = 1,...,n we have 


di < |V(Gi-1)| — 1. (5.4) 
Moreover, since v; is of minimum possible degree in Gi—1, we have 
2k = 2|E(G)| > 2|E(Gi-1)| > di - |V (Gi—1)l. (5.5) 


By merging and (5.5) we obtain 2k > d?, that is, d; < V2k. 

Let us now consider the process of the random choice of the coloring x 
by iteratively coloring the vertices in the order Un, Un—1,...,U1; each vertex 
is colored independently and uniformly at random with one of q := [V8k] 
colors. Let P; denote the event when the edge set E(G;) is properly colored. 
Clearly, Pr(P,,) = 1 and Pj C P; for j < i. Let us see what happens when we 
color the vertex vj, 1 < i < n. There are d; edges connecting v; with V (G4), 
thus at most d; possible values of x(v;) would cause some edge v,v;, j >i, 
to be incorrectly colored (i.e., x(vi) = x(v;)). Consequently, we obtain 
q— dj aJa di 
q q 





Pr(P;—1 | P;) > > 92da, (5.6) 
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where the last inequality follows from the estimate 1— x > 27? for 0 < x < i 
(recall that d; < V2k, that is, 7 < 5). 

Observe that Pp is the event when the edge set E(Go) = E(G) is properly 
colored. Moreover, 5~;_, di = k, as every edge is counted exactly once in the 
sum De di. By applying P;—ı C P; and for every 1 < i < n we obtain: 


Pr(Po) = Pr(Po | P,) Pr(P1) 
= Pr(Po | P,) Pr(P; | P>) Pr(P2) 
= Pr(P,) JPP- | P;) 


> II g-2di/4 = 9-2 ia di/q 
{=l 


— 9—2 k/[V8k] > g= vy k/2. 














In chromatic coding, 


e we use O(Vk) colors, and 
e the success probability is Q-O(Vk) (Lemma 5.10). 


Therefore, if the running time of the algorithm is single-exponential in 
the number of colors, i.e., for c colors it runs in time 20} nO, then 
20(vk) repetitions give a 2O(Vk) nO)_time algorithm with constant er- 
ror probability. Contrast this with color coding, where 


e we use O(k) colors, and 
e the success probability is 2-0). 


Thus if the running time of the algorithm is single-exponential in the 
number of colors, then 2°) repetitions give a 20()nO™-time algo- 
rithm with constant error probability. 


The d-CLUSTERING problem. Armed with Lemma |5.10| we exemplify the 
chromatic coding approach on the d-CLUSTERING problem. 

A graph H is called an ¢-cluster graph if H has £ connected components, 
and every connected component of H is a complete graph (clique). A graph 
H is a cluster graph if it is an &cluster graph for some £. For a graph G, 
by adjacencies in G we mean pairs in O. For a subset A C O, 
by modifying the adjacencies A in G we mean an operation G @ A that 
creates a graph with vertex set V(G @ A) = V(G) and edge set E(G @ A) = 
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(E(G) \ A) U (A \ E(G)). In other words, for any uv € A, if uv € E(G) then 
we delete uv from G, and otherwise we add wv to G. In the d-CLUSTERING 
problem, we are given an undirected graph G and a nonnegative integer k, 
and we ask whether there exists a set A of at most k adjacencies in G such 
that G @ A is a d-cluster graph. Such a set A is henceforth called a solution 
to the instance (G,k). We remark here that the integer d is a part of the 
problem definition, and is considered constant through the algorithm. 

In the first step of our algorithm, given an instance (G, k) of d- CLUSTERING, 
we pick q := [8k] and randomly color the vertices of G with q colors. Let 
x : V(G) > [q] be the obtained coloring. We say that a set of adjacencies 
AC Coe is properly colored if x properly colors the graph (V(G), A). 
Lemma ensures that, if (G,k) is a yes-instance and A is a solution to 
(G,k), A is properly colored with sufficiently high probability. Our goal now 
is to show an algorithm that efficiently seeks for a properly colored solution. 


Solving a colored instance. Assume we are given a d-CLUSTERING in- 
stance (G, k) and a coloring x. We start with the following simple observa- 
tion. 


Lemma 5.11. Let G be a graph and x : V(G) — |q] be a coloring function. 
Furthermore, let V, denote the vertices of G that are colored i. If there exists 
a solution A in G that is properly colored by x, then for every V;, G[V;] is an 
l-cluster graph for somel<d. 


Proof. Since every edge in A has endpoints of different colors, each of the 
subgraphs G/[V;] is an induced subgraph of G@ A. But G@ A is a d-cluster 
graph, and every induced subgraph of a cluster graph is also a cluster graph. 
Hence, G[V;] is a cluster graph with at most d components. 














We now proceed to the description of the algorithm. Suppose the given in- 
stance has a properly colored solution A. Lemma [5.11] implies that G[V;] is 
an ¢-cluster graph for some Z < d and it remains so in G @ A. Therefore, 
every component of G[V;] is a clique and it lies in one of the d components of 
the graph G @ A; for each such component let us guess where it lies in G@ A. 
Since the total number of components of the subgraphs G[Vj] is bounded by 
dq = d[V8k], we have at most dalk] — 20(dlogdvk) choices. Finally, it 
simply remains to check if the number of edges to be added and deleted to 
be consistent with the guess is at most k. All we need to do is to add to A 
any pair of adjacent vertices that are guessed to be in different components 
of G@ A and any pair of nonadjacent vertices that are guessed to be in the 
same component of G @ A. Hence, we obtain the following. 


Lemma 5.12. Given an instance (G,k) of d-CLUSTERING with a coloring 
x: V(G) > [[V8k]], we can test if there is a properly colored solution in 
time 20 (dlog dVk) pOl), 


Using a simple dynamic-programming approach one can show the following 


(see Exercise [5.7p. 
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Lemma 5.13. Given an instance (G,k) of d-CLUSTERING with a coloring 
xX: V(G) > [[V8k]], we can test if there is a properly colored solution in 
time 20(4Vk) pO), 


Combining Lemmas [5.10|and [5.13] we get that there is a randomized FPT 
algorithm for d-CLUSTERING running in time 2O(4Vk)n OW with a success 
probability of at least Q-O(Wk) Thus, repeating the above algorithm inde- 
pendently 20(Vk) times we obtain a constant success probability. This leads 
to the following theorem. 


Theorem 5.14. There exists a randomized algorithm that, given an instance 
(G,k) of d-CLUSTERING, in time QO(dVk)nO) either reports a failure or 
finds a solution to (G,k). Moreover, if the algorithm is given a yes-instance, 
it returns a solution with a constant probability. 


5.6 Derandomization 


It turns out that all presented algorithms based on some variant of the color 
coding technique can be efficiently derandomized. The goal of this section is 
to show basic tools for such derandomization. 


The basic idea of derandomization is as follows: instead of picking a 
random coloring x : [n] — [k], we deterministically construct a family 
F of functions f : [n] > [k] such that it is guaranteed that one of the 
functions from F has the property that we hope to attain by choosing 
a random coloring x. 


Quite surprisingly, it turns out that such families F of small cardinal- 
ity exist and can be constructed efficiently. In particular, if we estimated 
the probability that a random coloring x satisfies the desired property by 
1/f(k), the size of the corresponding family F is usually not much larger 
than f(k)logn. Consequently, we are able to derandomize most of the algo- 
rithms based on variants of the color coding technique with only small loss 
of efficiency. 

We limit ourselves only to formulating the appropriate results on construc- 
tions of pseudorandom objects we need, and showing how to derandomize the 
algorithms presented earlier in this section. The actual methods behind the 
constructions, although very interesting, are beyond the scope of this book. 
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5.6.1 Basic pseudorandom objects 


The basic pseudorandom object we need is called a splitter. 


Definition 5.15. An (n, k, ¢)-splitter F is a family of functions from [|n] to 
[£] such that for every set S C [n] of size k there exists a function f € F that 
splits S evenly. That is, for every 1 < j, j’ < £, |f-1() N S| and |f) AS] 
differ by at most 1. 


One can view a function f in a splitter F as a coloring of [n] into £ colors; 
the set S is split evenly if f uses on S each color roughly the same number 
of times. Note that for @ > k, the notion of “splitting S evenly” reduces to f 
being injective on S. Indeed, if |f~!(j) N S| > 2 for some j, then for every 
j’ we have |f~1(j’)M.$| > 1 and hence |S] > |H) N S| > 4+1 > |S, a 
contradiction. 

Our basic building block for derandomization is the following construction. 


Theorem 5.16 ([15]). For any n,k > 1 one can construct an (n,k,k?)- 
splitter of size kKO logn in time KOM nlogn. 


We emphasize here that the size of the splitter provided by Theorem 
is very small: it is polynomial in k, and depends only logarithmically on n 
(Exercise [5.18] asks you to show that this dependence on n is optimal.) This 
is achieved by allowing the members of a splitter to have a relatively large 
codomain, namely of size k?. 

However, observe that the case of a much smaller codomain, namely £ = k, 
directly corresponds to the color coding algorithm for LONGEST PATH: we 
were coloring the vertex set of a graph G with k colors, hoping to be injective 
on the vertex set of one fixed k-path in G. This special case of a splitter is 
known as a perfect hash family. 


Definition 5.17. An (n, k, k)-splitter is called an (n,k)-perfect hash family. 


To obtain a perfect hash family, we need to allow the splitter to be signif- 
icantly larger than in Theorem 


Theorem 5.18 ([372]). For any n,k > 1 one can construct an (n, k)-perfect 
hash family of size e*kO 8") logn in time e®kO "8 *) n logn. 


The proof of Theorem|5.18]is derived by a composition of an (n, k, k”)-splitter 
Fı obtained from Theorem with an explicit construction of a small 
(k?, k, k)-splitter Fo, i.e., the resulting (n, k)-perfect hash family is F = { f20 
fi : fi € Fi, fo € Fo}. (We denote by fzo fı the composition of the two 
functions obtained by performing fı first and then fo, that is, (foo fi)(a) = 
fol fule))) 

Observe that the actual size bound of Theorem [5.18] is very close to the 
inverse of the probability that a random function [n] — [k] is injective on a 
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given set S C [n] of size k (Lemma|5.4). Hence, if we replace repeating a color 
coding algorithm e* times with iterating over all elements of an (n, k)-perfect 
hash family of Theorem [5.18] we obtain a deterministic algorithm with only 
a very slight increase in the running time bound. 

For the random separation and divide and color algorithms, recall that we 
needed to capture a specific partition either of the set l U E(#), or of the 
vertex set of a path P in question. Hence, we need a notion that is able to 
capture all partitions of a given small subset of the universe. 


Definition 5.19. An (n,k)-universal set is a family U of subsets of [n] such 
that for any S C [n] of size k, the family {AN S : A € U} contains all 2* 
subsets of S. 


Similar to the case of perfect hash families, it is possible to obtain a uni- 
versal set of size very close to the inverse of the probability that a random 
subset, of [n] has a desired intersection with a fixed set S C [n] of size k. 


Theorem 5.20 ([372]). For any n,k > 1 one can construct an (n,k)- 
universal set of size 2*kO0°8*) logn in time 2*kO0°8")n logn. 


As in the case of Theorem the proof of Theorem [5.20] boils down to 
an explicit construction of a (k?,k)-universal set and composing it with a 
splitter of Theorem 

Finally, for the chromatic coding approach, we need to revise the coloring 
step and observe that in fact we do not need the colors of vertices to be 
independent in general; limited independence is sufficient. This brings us to 
the following definition. 


Definition 5.21 (k-wise independent). A family H,,,4,q of functions from 
[n] to [q] is called a k-wise independent sample space if, for every k positions 
1<t, <i <: < ip <n, and every tuple a € [q], we have 


Pr((f(é1), fiz), e.g f(tx)) = a) = q} 
where the function f € Hn, k, is chosen uniformly at random. 


Fortunately, there are known constructions of k-wise independent sample 
spaces of small size. 


Theorem 5.22 ([L0]). There exists a k-wise independent sample space Hn,k,q 
of size O(n*) and it can be constructed efficiently in time linear in the output 
size. 


In the next section we will see that to derandomize chromatic coding it 
suffices to modify the 2-independent sample space from Theorem 
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5.6.2 Derandomization of algorithms based on variants 
of color coding 


We now show how to derandomize the algorithms from Section [5.2] using the 
pseudorandom objects of Section [5.6.1] For the sake of completeness, we first 
repeat the more or less straightforward arguments for color coding and divide 
and conquer. Then, we show a more involved argument for chromatic coding. 


Color coding and LONGEST PATH. Let (G,k) be the input instance for 
LONGEST PATH, where n = |V(G)|. Instead of taking a random coloring x 
of V(G), we use Theorem [5.18] to construct an (n,k)-perfect hash family F. 
Then, for each f € F, we proceed as before, that is, we invoke the dynamic- 
programming algorithm of Lemma|5.5}for the coloring x := f. The properties 
of an (n,k)-perfect hash family F ensure that, if there exists a k-path P in 
G, then there exists f € F that is injective on V(P) and, consequently, the 
algorithm of Lemma [5.5] finds a colorful path for the coloring x := f. Hence, 
we obtain the following deterministic algorithm. 


Theorem 5.23. LONGEST PATH can be solved in time (2e)*kO°8) nO by 
a deterministic algorithm. 


We emphasize here that the almost-optimal size bound of Theorem 
makes the running time of the algorithm of Theorem only slightly worse 
than that of Theorem 


Divide and color and LONGEST PATH. Let (G,k) again be the input in- 
stance for LONGEST PATH, where n = |V(G)|. We derandomize the proce- 
dure FASTER-RANDOMIZED-PATHS(G, k) of Section [5.4] as follows: instead 
of taking f(k) random partitions of V(G), we use Theorem [5.20] to compute 
an (n, k)-universal set U and recurse on all partitions (L := A, R := V(G)\ A) 
for A € U. By the properties of a universal set, for every u,v € V(G) and 
every k-path P from u to v in G, there exists A € U where AN V (P) consists 
of the first [4] vertices of P. Consequently, there always exists at least one 
recursive step where the path P is properly partitioned, and the algorithm 
actually computes Dvo) = Dy(G),k- 

As for the running time bound, observe that the size bound of U promised 
by Theorem [5.20] is very close to the choice f(k) = 2* log(4k) we made in 
Section Consequently, if we solve the appropriate variant of recursive 
formula (5.3) we obtain an algorithm with almost no increase in the running 
time (in fact, all increase is hidden in the big-O notation). 


Theorem 5.24. LONGEST PATH can be solved in time 4'+°!) nO by a 
deterministic algorithm. 


The derandomization of the random separation algorithm for SUBGRAPH 
ISOMORPHISM in graphs of bounded degree, presented in Section is 
straightforward and postponed to Exercise 
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5.6.2.1 A derandomized chromatic coding algorithm for 
d-CLUSTERING 


Let (G,k) be the input instance for d-CLUSTERING, where n = |V(G)|. Fol- 
lowing previous examples, we would like to replace the random choice of a 
coloring of V(G) into q = O(Vk) colors with an iteration over some pseu- 
dorandom object. Recall that in Section the essential property we were 
hoping for was that the obtained coloring x properly colors the solution A. 
This leads to the following definition: 


Definition 5.25. A (n,k,q)-coloring family is a family F of functions from 
[n] to [q] with the following property: for every graph G on the vertex set 
[n] with at most k edges, there exists a function f € F that properly colors 
E(G). 


Thus, to derandomize the algorithm of Theorem we need to provide 
an efficient way of constructing a small (n, k, q)-coloring family for some q = 
O(Vk). The basic idea is that a 2-wise independent sample space An k,q 
should be close to our needs for q = cVk and sufficiently large constant c. 

As in the case of construction of an (n, k)-perfect hash family or an (n, k)- 
universal set, it suffices to focus on the case n = k?; the general case can be 
resolved by composing with an (n, k, k?)-splitter of Theorem [5.16] Hence, we 
start with the following explicit construction. 


Lemma 5.26. For any k > 1, there exists a (k?,k,2[/Vk])-coloring family F 
of size QO(Vklogk) that can be constructed in time linear in its size. 


Proof. Denote q = [Vk]. We use Theorem [5.29] to obtain a 2-wise indepen- 
dent sample space G := Hy2,9,q. Note that the size of G is bounded by O(k*). 
Recall that every element g € G is a function g : [k?] > [q]. 

We can now describe the required family F. For each g € G and each 
subset T C [k?] of size |T| = q, we define a function fyr € F as follows. 
Suppose T = {t1,%2,..., iq}, with i1 < ig <...< iq. For 1 < j < q, we define 
fi) =q + j, and f(i) = g(i) if i ¢ T. Note that f(i) € [2q] for any i € [k?], 
and the size of F is at most 


"A ig] — 90(Vk log k) 
q 


To complete the proof we have to show that for every graph G on the set 
of vertices |k?] with at most k edges, there is an f € F that properly colors 
E(G). Fix such a graph G. 

We use the probabilistic method, i.e., we choose T and g in the definition 
of the function fọ r in a random way, so that f,r provides the required 
coloring for G with positive probability, which implies the existence of the 
desired function in F. The idea can be sketched as follows. The function g 
is chosen at random in G, and is used to properly color all but at most q 
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edges of E(G). The set T is chosen to contain at least one endpoint of each 
of these edges, and the vertices in the set T will be re-colored by a unique 
color that is used only once by fg,r. Then surely any edge incident to T will 
have different colors on its endpoints. Using the properties of G we now prove 
that with positive probability the number of edges uv € E(G) for which 
g(u) = g(v) (henceforth called monochromatic edges) is bounded by Vk. 


Claim 5.27. If the vertices of G are colored by a function g chosen at random 
from G, then the expected number of monochromatic edges is at most k/q < 


Vk. 


Proof. Fix an edge e in the graph G and j € [q]. As g maps the vertices in 
a pairwise independent manner, the probability that both the endpoints of 
e get mapped to 7 is precisely zr There are q possibilities for 7 and hence 


the probability that e is monochromatic is +. Let X be the random variable 
denoting the number of monochromatic edges. By linearity of expectation, 
the expected value of X is at most k- E < vk. 4 


Returning to the proof of the lemma, observe that by the above claim, with 
positive probability the number of monochromatic edges is upper bounded 
by q = [Vk]. Fix a g € G for which this holds and let T be a set of q vertices 
containing at least one endpoint of every monochromatic edge. Consider the 
function f,,r. As mentioned above, fọ r colors each of the vertices in T by 
a unique color, which is used only once by fy,r, and hence we only need to 
consider the coloring fg,r restricted to G \ T. However all edges of G \ T are 
properly colored by g and fg r coincides with g on G\T. Hence f,:r properly 
colors E(G), completing the proof of the lemma. 














Now we are ready to state the main result of this section. 


Theorem 5.28. For any n,k > 1, there exists an (n, k,2[Wk])-coloring fam- 
ily F of size 20 (Vk log k) log n that can be constructed in time 20(Vk log k) logn. 


Proof. First, use Theorem 5.16] to obtain an (n, k, k?)-splitter F,. Second, 
use Lemma|5.26|to obtain a (k?, k, [Vk])-coloring family Fo. Finally, define 
F :={f20 fı : fi € Fi, f2 € Fo}. A direct check shows that F satisfies all 
the desired properties. 














As announced at the beginning of this section, if we replace the random 
choice of a coloring in Theorem [5.14] with an iteration over the coloring fam- 
ily given by Theorem we obtain a deterministic subexponential time 
algorithm for d-CLUSTERING. 


Theorem 5.29. d-CLUSTERING can be solved in time 20 (VK (44108 k)) nO), 


Note that for d = O(1), the derandomization step resulted in an additional 
log k factor in the exponent, as compared to Theorem 
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Exercises 


5.1. In the TRIANGLE PACKING problem, we are given an undirected graph G and a positive 
integer k, and the objective is to test whether G has k-vertex disjoint triangles. Using color 
coding show that the problem admits an algorithm with running time 20()n°@), 


5.2. In the TREE SUBGRAPH ISOMORPHISM, we are given an undirected graph G and a 
tree T on k vertices, and the objective is to decide whether there exists a subgraph in G 
that is isomorphic to T. Obtain a 20(*)nO-time algorithm for the problem using color 
coding. 


5.3. Consider the following problem: given an undirected graph G and positive integers k 
and q, find a set X of at most k vertices such that G \ X has at least two components of 
size at least q. Show that this problem can be solved in time 20(49+*)n0(), 


5.4 (2). Assuming q > k, solve the problem from the previous problem in randomized 
time qe) nO, Can you derandomize your algorithm without any significant loss in the 
running time? 


5.5 (2). Show formally the bound pe = 27°(!°s% in the analysis of the procedure 
SIMPLE-RANDOMIZED-PatHs(X, £) in Section That is, show that for sufficiently large 


c, pe = 27% 108. satisfies (5.1). 
5.6 (@). Solve formally the recurrence (5.3). 
5.7. Prove Lemma|5.13 


5.8. Give a randomized FPT algorithm for the problem of deciding whether a given undi- 
rected graph contains a cycle of length at least k. Your algorithm should have running 
time ckn©°™., Note that a graph may not have any cycle of length exactly k, but contain 
a much longer cycle. Derandomize your algorithm using perfect hash families. 


5.9 (#2). Give a randomized FPT algorithm for the problem of deciding whether a given 
directed graph contains a cycle of length at least k. Your algorithm should use color coding 
with k? colors, and have running time 20(k*)n (1), Then improve the running time to 
kO(k)nOQ) by only using colors {1,...,k+ 1}, and assigning color k + 1 with probability 
1 — jy. Finally, derandomize your algorithm using (n, k?, k*)-splitters. 


5.10. In the SET SPLITTING problem, we are given a family of sets F over a universe U 
and a positive integer k, and the goal is to test whether there exists a coloring of U with 
two colors such that at least k sets in F are not monochromatic (that is, they contain 
vertices of both colors). 


1. Obtain a randomized FPT algorithm with running time 2*(\U| + |F|)OM. 
2. Using universal sets derandomize your algorithm and obtain a running time bound 
4k (U| + [FI)OM. 


5.11. In the PARTIAL VERTEX COVER problem, we are given an undirected graph G and 
positive integers k and t, and the goal is to check whether there exists a set X C V(G) 
of size at most k such that at least t edges of G are incident to vertices on X. Obtain an 
algorithm with running time 20/n©Q@) for the problem. (We show in Chapter [13] that 
this problem is W[1]-hard parameterized by the solution size k.) 


5.12. In the PARTIAL DOMINATING SET problem, we are given an undirected graph G and 
positive integers k and t, and the goal is to check whether there exists a set X C V(G) 
of size at most k such that |NG[X]| > t (that is, X dominates at least t vertices). Obtain 
an algorithm with running time 20()n°@) for the problem. (We show in Chapter[13]that 
this problem is W[1]-hard parameterized by the solution size k.) 
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5.13. A graph H is called r-colorable if there exists a function x : V(H) — [r] such that 
x(u) 4 x(v) for every uv € E(H). Consider the following problem: given a perfect graph G 
and integers k and r, check whether G admits an r-colorable induced subgraph on at least 
k vertices. Show an algorithm for this problem with running time f(k,r)n©“™). You could 
use the fact that we can find a maximum independent set in perfect graphs in polynomial 
time. 


5.14. In the PSEUDO ACHROMATIC NuMBER problem, we are given an undirected graph G 
and a positive integer k, and the goal is to check whether the vertices of G can be partitioned 
into k groups such that each pair of groups is connected by at least one edge. Obtain a ran- 
domized algorithm for PSEUDO ACHROMATIC NUMBER running in time 20(k? log k) ,O(2), 


5.15. Consider a slightly different approach for SUBGRAPH ISOMORPHISM on graphs of 
bounded degree, where we randomly color vertices (instead of edges) with two colors. 


1. Show that this approach leads to a 2°¢+)*kIn©@)-time Monte Carlo algorithm with 
false negatives. 
2. Improve the dependency on d in the running time of the algorithm to dO) InP), 


5.16. In the SPLIT EDGE DELETION problem, we are given an undirected graph G and a 
positive integer k, and the objective is to test whether there exists a set S C E(G) of size 
at most k such that G\ S is a split graph. Using chromatic coding show that the problem 
admits a 20(V# log k) 2 O(1)-time algorithm. 


5.17 (&). Show an algorithm for the SPLIT Enae DELETION problem (defined in the 
previous exercise) with running time 20(V') n00), 


5.18 (Z). Show that, for every n, > 2, any (n,2,£)-splitter needs to contain at least 
log, n elements. In other words, show that the logn dependency in the size bound of 


Theorem is optimal. 


5.19 (&). Give a deterministic version of Theorem|[5.7| using (n, k)-universal sets. 


5.20 (Z). Give a deterministic version of the first algorithm developed in Exercise 
Your algorithm should run in time 204+) *+0(4k) pn OQ), 


5.21 (%). Give a deterministic version of the second algorithm developed in Exercise|5.15 
Your algorithm should run in time 20(4(eg d+log k))pO(1), 


5.22 (Z). Show formally the running time bound of Theorem That is, formulate 
and solve the corresponding variant of recurrence (5-3). 


5.23 (Z). Complete the proof of Theorem verify that the size of the obtained family 
F satisfies the promised bound, and that F is in fact a (n, k, 2[/Wk])-coloring family. 


Hints 


[5-2] The coloring step is exactly the same as in the Lonarst PATH example: color the 
vertices of G with k colors, hoping to make the subgraph in question colorful; let V; be 
the set of vertices colored i. Then, design a dynamic-programming algorithm to find a 
colorful subgraph isomorphic to T. To this end, root T at an arbitrary vertex and, for 
every x € V(T), denote by Tx the subtree of T rooted at x. By the dynamic-programming 
approach, for every x € V(T), for every v € V(G), and for every set S of exactly |V (Tz)|—1 
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colors, check whether there exists a colorful subgraph of G|U;es Vi U {v}] isomorphic to 
Tx, where the vertex v corresponds to the root x. 


Randomly partition V(G) into two parts and hope that the vertices of the solution 
X will be placed on the left, while a q-vertex connected subgraph of each of the two large 
components of G \ X is placed on the right. 


Use the same approach as in the previous problem, but play with probabilities: assign 
each vertex to the left with probability 1/q, and to the right with the remaining probability. 
To derandomize the algorithm, use a (n, 2q + k, (2q + k)?)-splitter. 


Prove the following observation: if a graph G contains a cycle of length at least 2k, 
then, after contracting an arbitrary edge, it still contains a cycle of length at least k. Use 
this, together with color coding, to get the algorithm. 


Note that contracting a single edge may create a directed cycle even if there was none 
before contraction. Prove that contracting all of the edges of a directed cycle C cannot 
turn a no-instance into a yes-instance. Prove that if G has a cycle of length at least t before 
contracting C, then G has a cycle of length at least t/|C| after contracting C. Use this, 
together with color coding, to get the algorithm. 

To improve the running time from 2©(**) to kO(*) observe that if G contains a cycle 
C such that the k first vertices of C are colored 1,2,...,k, and the remaining vertices are 
colored k + 1, then we may find a cycle of length at least k in G in polynomial time. 


5.10| Fix a solution f : U — {1,2} and k sets Fi, Fo,...,Fe E€ F that are not 
monochromatic in the coloring f. For each set F;, fix two elements a;,1, a;,2 E€ F; such 
that f(ai1) A f(ai,2). Denote A := UŽ {ai,1, 04,2}. Prove that 2(2/4|-*) colorings 
g: A > {1,2} satisfy g(ai1) Æ g(ai,2) for every 1< i < k. 


Perform the color coding step with t colors on the set E(G). Then, a colorful solution 
is a set of at most k vertices that is adjacent to at least one edge of every color. 


Proceed in a way similar to the solution of the previous problem. Perform the color 
coding step with t colors on the set V(G). Then, a colorful solution is a set X of at most 
k vertices such that NG[X] contains at least one vertex of every color. 


5.13] First, observe that, without loss of generality, we may look for an induced subgraph 
with exactly k vertices. Second, show that a random coloring of G into r colors correctly 
colors the subgraph in question with good probability. Finally, observe that, given a colored 
instance, it suffices to find a maximum size independent set in each color class indepen- 
dently. 


Prove that a random coloring of V(G) into k colors does the job with good probability. 


5.15] Let A be a subgraph of G isomorphic to H. A coloring x : V(G) > {R, B} is 
successful if V(H) C x~1(R) but No(V(H)) C x~1(B). Observe that |V(H)| = k and 
\Na(V(H))| < dk by the degree bound. The first point follows by using a brute-force 
algorithm to check isomorphism in the construction of the graph B(H, Gpr) where Gr = 
G[x71(R)]. : 

For the second point, the key idea is to bias the probability: since our bound for |V(H)| 
is much better than the one for INc(V(A))|, we may color independently every vertex red 
with probability 1/d and blue with probability 1—1/d. In this manner, x is successful with 
probability: 


1\ VI 1\ INe v 8))| 1\e 
= (lee >d-*.(1-—=) sd taO, 
d d = d 
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5.16] Perform chromatic coding as in the case of d-CLUSTERING. In the second phase, use 
the fact that every n-vertex split graph has only O(n) possible partitions of the vertex set 
into the clique and independent set parts. To obtain the 20 (VYF log k)pO(1) running time 
bound, you need to either obtain a polynomial kernel for the problem (which is done in 
Exercise B-36), or wrap the algorithm with iterative compression. 


The crucial observation is the following: every n-vertex graph on at most k edges 
has at most 20(V*)n subgraphs being complete graphs. To see this claim, consider the 
ordering v1, v2,...,Un of V(G) defined in the proof of Lemma[5.10] To identify a complete 
subgraph H of G, it suffices to give (a) a vertex v; E€ V(H) of lowest possible index (n 
choices), and (b) the set V(H) \ {ui} = Na (vi) NV(Gi) (2% = 2°(VF) choices, as v; has 
degree d; in V(G;_-1) and d; = O(Vk)). 

To utilize the aforementioned observation, proceed with iterative compression, adding 
vertices one by one to the graph G. Assume we have a solution F to SPLIT EDGE DELETION 
on (G — v,k) and we would like find a solution F’ to SPLIT Epee DELETION on (G, k). 
Furthermore, let V(G) \ {v} = CWI be (any) partition of the vertex set of the split graph 
G—v-—F into the clique and independent set parts. Observe that, if V(G) = C’ w T’ is 
(any) partition of the vertex set of the split graph G — F” into the clique and independent 
set parts, then J \ I’ induces a clique in G. Moreover, E(G[I]) C F and, consequently, 
Gl] has at most k edges. Thus, there are 20(V*)n choices for J \ I’. Hence, we can guess 
I\ T, as well as whether the new vertex v belongs to C’ or I’. With this information, it 
is straightforward to deduce the remaining parts of C’ and I’. 


Observe that ifn > ¢!7! in a (n, 2, ¢)-splitter F, then there are two elements a, b € [n] 
such that for every f € F we have f(a) = f(b). 


Observe that you need the correct partition of E(H) U I’, which is of size at most 
dk. Thus, you need a (|E£(G)|, p)-universal set for every k < p < dk. 


Observe that you need the correct partition of No[V (B), which is of size at most 
(d+ 1)k. Thus, you need a (|V(G)|, p)-universal set for every k < p < (d+ 1)k. 


The biased separation, used in the solution for Exercise[5.15] can be usually efficiently 
derandomized using a splitter from Theorem[5.16] _ 

Recall that we need to correctly separate NG|V(H)], which is of size at most (d+ 1)k. 
We first guess p = |Nc[V(A)]| < (d+ 1)k and we construct a (n,p,p”)-splitter F of 
polynomial size. Then, we iterate through every function f € F and every set X C [p?] of 
size exactly k and consider a coloring x : V(G) > {R, B} defined as: x(v) = R if and only 
if f(v) € X. By the definition of a splitter, there exists f € F that is injective on Ng|V(H)]| 
and we consider a set X = f(V(f)). For this choice of f and X, the coloring x is successful. 


Finally, note that the size of F is polynomial, whereas there are (°°) = 20(k (log d+log k)) 
choices for the set X. 

Intuitively, in this approach we use a generic splitter whose size is negligible in the 
running time, and then we mimic the bias of the separation by guessing a small set in the 
(largish) codomain of the splitter. 
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Chapter 6 
Miscellaneous 


In this chapter, we gather a few algorithmic tools 
that we feel are important, but do not fit into any of 
the previous chapters. First, we discuss exponential- 
time dynamic-programming algorithms that consider 
all subsets of a certain set when defining the subprob- 
lems. Second, we introduce the INTEGER LINEAR PRo- 
GRAMMING FEASIBILITY problem, formulate the clas- 
sical results on how to solve it in the case of a bounded 
number of variables, and show an example of its ap- 
plication in fixed-parameter algorithms. Finally, we 
discuss the algorithmic implications of the Robertson- 
Seymour theorem on graph minors. 





The chapter consists of three independent sections, each tackling a dif- 
ferent tool in parameterized algorithms. Since a full exposition of, say, the 
algorithmic applications of the graph minors project would constitute a large 
volume in itself, we have decided to give only a glimpse of the topic in each 
of the three sections. 

In Section we present a common theme in many exponential-time 
algorithms: a dynamic-programming algorithm, where the subproblems are 
defined by considering all subsets of a certain set of elements (and hence 
the number of subproblems and the running time are exponential in the 
number of elements). As a first example, we give a simple 2!7! (U| + |F) ®- 
time dynamic-programming algorithm for SET COVER with a family of sets 
F over a universe U. Then, we obtain a 3!*!nO@ algorithm for STEINER 
TREE, where K is the set of terminal vertices. Using additional ideas, we will 
improve the running time of this algorithm in Section [10.1.2] 

Section introduces yet another tool to design fixed-parameter algo- 
rithms, namely integer linear programs. In turns out that many NP-hard 
problems can be expressed in the language of INTEGER LINEAR PROGRAM- 
MING. For example, in Section [2.5] we have seen how a VERTEX COVER in- 
stance can be encoded as an INTEGER LINEAR PROGRAMMING instance. 
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In 1983, Lenstra showed that INTEGER LINEAR PROGRAMMING is fixed- 
parameter tractable when parameterized by the dimension of the space, i.e., 
the number of variables. (Naturally, Lenstra did not use the terminology of 
fixed-parameter tractability, as it was introduced much later.) Thus, Lenstra’s 
result gives us a very general tool for proving fixed-parameter tractability of 
various problems. Note that a similar phenomenon happens in the world of 
polynomial-time algorithms, where a large number of tractable problems, in- 
cluding the problems of finding a shortest path or a maximum flow, can be 
represented in the language of LINEAR PROGRAMMING. In Section we 
state the fastest known algorithm for INTEGER LINEAR PROGRAMMING, and 
exemplify its usage on the IMBALANCE problem. 

In Section [6.3] we move to the theory of graph minors. The Graph Minors 
project of Robertson and Seymour, developed in the last three decades, re- 
sulted not only in achieving its main goal — proving Wagner’s conjecture, 
asserting that the class of all graphs is well-quasi-ordered by the minor re- 
lation — but also flourished with other tools, techniques and insights that 
turned out to have plenty of algorithmic implications. In this chapter, we re- 
strict ourselves only to the implications of the Robertson-Seymour theorem 
in parameterized complexity, and we show how theorems from Graph Minors 
immediately imply fixed-parameter tractability of such problems as detecting 
the Euler genus of a graph. Robertson-Seymour theory also gives us pretext 
to discuss the notion of nonuniform fixed-parameter algorithms. We remark 
here that the next chapter, Chapter [?| is entirely devoted to the algorithmic 
usage of the notion of treewidth, a different offspring of the Graph Minors 
project. 


6.1 Dynamic programming over subsets 


In this section, we give two examples of dynamic-programming algorithms 
over families of sets. Our examples are SET COVER and STEINER TREE. 


6.1.1 SET COVER 


Let F be a family of sets over a universe U. For a subfamily F’ C F anda 
subset U’ C U, we say that F’ covers U’ if every element of U’ belongs to 
some set of F’, that is, U’ C |J F’. In the SET Cover problem, we are given 
a family of sets F over a universe U and a positive integer k, and the task 
is to check whether there exists a subfamily F’ C F of size at most k such 
that F’ covers U. We give an algorithm for SET COVER that runs in time 
2lUI (U| + |F|)O™. In fact, this algorithm does not use the value of k, and 
finds the minimum possible cardinality of a family F’ C F that covers U. 
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Theorem 6.1. Given a SET COVER instance (U,F,k), the minimum pos- 
sible size of a subfamily F' C F that covers U can be found in time 


AEU + |FI)OM. 


Proof. Let F = {F,, F2,..., Fiz \}- We define the dynamic-programming ta- 
ble as follows: for every subset X C U and for every integer 0 < j < |F|, we 
define T|X, j] as the minimum possible size of a subset F’ C {F\, Fo,..., Fj} 
that covers X. (Henceforth, we call such a family F’ a valid candidate for 
the entry T[X, j].) If no such subset F’ exists (i.e., if X Z U F;), then 
TIX, j] = +00. 

In our dynamic-programming algorithm, we compute all 2!4!(|F| +1) val- 
ues T[X, j]. To achieve this goal, we need to show (a) base cases, in our 
case values T[X, 7] for j = 0; (b) recursive computations, in our case how to 
compute the value T|X, j] knowing values T[X’, 7’] for j’ < j. 

For the base case, observe that T[Ø, 0] = 0 while TLX, 0] = +00 for X # 0). 

For the recursive computations, let X C U and 0 < j < |F|; we show that 


T|X, j] = min(T[X, j — 1],1+T7[X \ Fj, j — 1). (6.1) 


We prove by showing inequalities in both directions. In one direction, 
let F’ C {F\, Fo,...,F;} be a family of minimum size that covers X. We 
distinguish two cases. If F; ¢ F’, then note that F’ is also a valid candidate 
for the entry T|X, j — 1] (ie., F” C {F1, F2,..., Fj-1} and F’ covers X). If 
F; € F', then F’\ {F;} is a valid candidate for the entry T[X\ F}, j— 1]. In the 
other direction, observe that any valid candidate F’ for the entry TX, j — 1] 
is also a valid candidate for T[X, j| and, moreover, for every valid candidate 
F' for T|X \ Fj, j — 1], the family F’ U {F;} is a valid candidate for TLX, j]. 
This finishes the proof of (6.1). 

By using (6.1), we compute all values T|X, j| for X CU and 0 <j < |F 
within the promised time bound. Finally, observe that T[U, |F|] is the answer 
we are looking for: the minimum size of a family F’ C {F,, Fo,..., Fiz} =F 
that covers U. 














We remark that, although the dynamic-programming algorithm of Theo- 
rem [6.1] is very simple, we suspect that the exponential dependency on |U], 
that is, the term 2!4!, is optimal. However, there is no known reduction that 
supports this claim with the Strong Exponential Time Hypothesis (discussed 


in Chapter [14}. 


6.1.2 STEINER TREE 


Let G be an undirected graph on n vertices and K C V(G) bea set of termi- 
nals. A Steiner tree for K in G is a connected subgraph H of G containing 
K, that is, K C V(H). As we will always look for a Steiner tree of minimum 
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possible size or weight, without loss of generality, we may assume that we 
focus only on subgraphs H of G that are trees. The vertices of V(H) \ K 
are called Steiner vertices of H. In the (weighted) STEINER TREE problem, 
we are given an undirected graph G, a weight function w: E(G) > Rso and 
a subset of terminals K C V(G), and the goal is to find a Steiner tree H for 
K in G whose weight w(H) = >) .cx(#) W(e) is minimized. Observe that if 
the graph G is unweighted (i.e., w(e) = 1 for every e € E(G)), then we in 
fact optimize the number of edges of H, and we may equivalently optimize 
the number of Steiner vertices of H. 

For a pair of vertices u,v € V(G), by dist(u,v) we denote the cost of a 
shortest path between u and v in G (i.e., a path of minimum total weight). 
Let us remind the reader that, for any two vertices u, v, the value dist(u, v) is 
computable in polynomial time, say by making use of Dijkstra’s algorithm. 

The goal of this section is to design a dynamic-programming algorithm for 
STEINER TREE with running time 3!K!nO™, where n = |V(G)|. 

We first perform some preprocessing steps. First, assume |K| > 1, as oth- 
erwise the input instance is trivial. Second, without loss of generality, we 
may assume that G is connected: a Steiner tree for K exists in G only if all 
terminals of K belong to the same connected component of G and, if this is 
the case, then we may focus only on this particular connected component. 
This assumption ensures that, whenever we talk about some minimum weight 
Steiner tree or a shortest path, such a tree or path exists in G (i.e., we do 
not minimize over an empty set). Third, we may assume that each terminal 
in K is of degree exactly 1 in G and its sole neighbor is not a terminal. To 
achieve this property, for every terminal t € K, we attach a new neighbor t 
of degree 1, that is, we create a new vertex t and an edge tt’ of some fixed 
weight, say 1. Observe that, if |K| > 1, then the Steiner trees in the original 
graph are in one-to-one correspondence with the Steiner trees in the modified 
graphs. 

We start with defining a table for dynamic programming. For every 
nonempty subset D C K and every vertex v € V(G) \ K, let T[D,v] be 
the minimum possible weight of a Steiner tree for DU {v} in G. 


The intuitive idea is as follows: for every subset of terminals D, and 
for every vertex v € V(G) \ K, we consider the possibility that in the 
optimal Steiner tree H for K, there is a subtree of H that contains 
D and is attached to the rest of the tree H through the vertex v. For 
|D| > 1, such a subtree decomposes into two smaller subtrees rooted at 
some vertex u (possibly u = v), and a shortest path between v and u. 
We are able to build such subtrees for larger and larger sets D through 
the dynamic-programming algorithm, filling up the table T[D, v]. 


The base case for computing the values T[D, v] is where |D| = 1. Observe 
that, if D = {t}, then a Steiner tree of minimum weight for DU {v} = {v, t} 
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is a shortest path between v and t in the graph G. Consequently, we can fill 
in T[{t}, v] = dist(t, v) for every t € K and v E€ V(G)\ K. 

In the next lemma, we show a recursive formula for computing the values 
T|D, v] for larger sets D. 


Lemma 6.2. For every D C K of size at least 2, and every v € V(G)\ K, 
the following holds 


T[D,v| = pe {T[D',u] + T[D \ D',u] + dist(v, u)}. (6.2) 
0AD'CD 


Proof. We prove by showing inequalities in both directions. 

In one direction, fix u € V(G) and Ø # D’ © D. Let H, be the tree 
witnessing the value T[D’, u], that is, H; is a Steiner tree for D’ U {u} in G 
of minimum possible weight. Similarly, define Hp for the value T[D \ D’, ul. 
Moreover, let P be a shortest path between v and u in G. Observe that 
Hı U Hə U P is a connected subgraph of G that contains D U {v} and is of 
weight 


w(H,UA2UP) < w(A1)+w(H2)+w(P) = T[D’, uj] +T[D\ D’, u]+dist(v, u). 
Thus 


T[D,v] < mi TD’, u] + T[D \ D’,u] + dist(v, u)}. 
[D, v pe e [D’,u] + T[D \ D', u] + dist(v, u)} 
04D'CD 


In the opposite direction, let H be a Steiner tree for DU {v} in G of mini- 
mum possible weight. Let us root the tree H in the vertex v, and let uo be the 
vertex of H that has at least two children and, among such vertices, is closest 
to the root. An existence of such a vertex follows from the assumptions that 
|D| > 2 and that every terminal vertex is of degree 1. Moreover, since every 
terminal of K is of degree 1 in G, we have uo ¢ K. Let u be an arbitrarily 
chosen child of uo in the tree H. We decompose H into the following three 
edge-disjoint subgraphs: 


1. P is the path between ug and v in H; 

2. Hı is the subtree of H rooted at u1, together with the edge uguy; 

3. Hə consists of the remaining edges of H, that is, the entire subtree of H 
rooted at uo, except for the descendants of u1 (that are contained in H1). 
See Fig. 

Let D’ = V(H1) NK be the terminals in the tree H4. Since every terminal 
is of degree 1 in G, we have D \ D’ = V(H2) K. Observe that, as H is of 
minimum possible weight, D’ 4 0, as otherwise H \ Hı is a Steiner tree for 
DU {v} in G. Similarly, we have D’ Ç D as otherwise H \ Hə is a Steiner 
tree for D U {v} in G. Furthermore, note that from the optimality of H it 
follows that w(H,) = T[D’, uo], w(H2) = T[D \ D’, uo] and, moreover, P is 
a shortest path between ug and v. Consequently, 
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uo 


U1 


Hı Hə 


Fig. 6.1: Decomposition of H 


T[D, v] = w(H) = T[D’, uo] + T[D \ D’, uo] + dist(v, uo) 
> i T|D' u] + T[D \ D',u] + dist(v, u)}. 
> amin „{TD',u] + TID\ D'u] + dist(o,u)} 
04D'CD 














This finishes the proof of the lemma. 


With the insight of Lemma [6.2| we can now prove the main result of this 
section. 


Theorem 6.3. STEINER TREE can be solved in time 3|KlnOM, 


Proof. Let (G, w, K) be an instance of STEINER TREE after the preprocessing 
steps have been performed. We compute all values of T[D, v] in the increasing 
order of the cardinality of the set D. As discussed earlier, in the base case we 
have T[{t}, v] = dist(t, v) for every t € K and v € V(G) \ K. For larger sets 
D, we compute T[D, v] using (6.2); note that in this formula we use values 
of T[D', u] and T[D \ D’,u], and both D’ and D \ D’ are proper subsets of 
D. In this manner, a fixed value T[D, v] can be computed in time 2!?/nO™, 
Consequently, all values T[D,v] are computed in time 


IKI 


DED nO <n Gy ne — 311,00), 
j 


veV(G)\K DCK j=2 


Finally, observe that, if the preprocessing steps have been performed, then 
any Steiner tree for K in V(G) needs to contain at least one Steiner point 
and, consequently, the minimum possible weight of such a Steiner tree equals 
min,cv(q)\K TK, v]. 


We will see in Section [10.1-2]how the result of Theorem |[6.3|can be improved. 














6.2 INTEGER LINEAR PROGRAMMING 135 


6.2 INTEGER LINEAR PROGRAMMING 


In this section, we take a closer look at parameterized complexity of INTEGER 
LINEAR PROGRAMMING. 

We start with some definitions. In the INTEGER LINEAR PROGRAMMING 
FEASIBILITY problem, the input consists of p variables 71,%2,...,@%p, and a 
set of m inequalities of the following form: 














41101 T 1,202 sae Q1 ptp < by 
Q2,1 %1 T A2,2%2 T... T a2 nXp < bo 
am,1£1 + am, 2£2+ ... + am, plp < bm 


where every coefficient a; ; and b; is required to be an integer. The task is to 
check whether one can choose integer values for every variable x; so that all 
inequalities are satisfiable. 

Equivalently, one may look at an INTEGER LINEAR PROGRAMMING FEA- 
SIBILITY instance as a matrix A € Z™*? and a vector b € Z™; the task is 
to check whether there exists a vector x € ZP such that Ax < b. We assume 
that an input of INTEGER LINEAR PROGRAMMING FEASIBILITY is given in 
binary and thus the size of the input is the number of bits in its binary 
representation. 

In typical applications, when we want to solve a concrete algorithmic prob- 
lem by formulating it as INTEGER LINEAR PROGRAMMING FEASIBILITY, we 
may assume that the absolute values of the variables in the solution are poly- 
nomially bounded by the size n of the instance we want to solve. Therefore, if 
the INTEGER LINEAR PROGRAMMING FEASIBILITY instance has p variables, 
then we may solve it in time n°) by brute force. The main technical tool 
that we use in this section is the fact that INTEGER LINEAR PROGRAMMING 
FEASIBILITY is fixed-parameter tractable parameterized by the number of 
variables. This opens up the possibility of obtaining FPT results by trans- 
lating the problem into an instance of INTEGER LINEAR PROGRAMMING 
FEASIBILITY with bounded number of variables. 











Theorem 6.4 ([280], ; ). An INTEGER LINEAR PROGRAMMING 
FEASIBILITY instance of size L with p variables can be solved using O(p??+°). 
L) arithmetic operations and space polynomial in L. 


In other words, Theorem says that INTEGER LINEAR PROGRAMMING 
FEASIBILITY is fixed-parameter tractable when parameterized by p, with a 
relatively good (slightly super-exponential) dependence on the parameter and 
linear dependence on the input size. 

In some applications, it is more convenient to work with an optimiza- 
tion version of the INTEGER LINEAR PROGRAMMING FEASIBILITY problem, 
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namely INTEGER LINEAR PROGRAMMING. Although this problem has been 
already briefly discussed in Section [2.5] let us now recall its definition. In the 
INTEGER LINEAR PROGRAMMING problem, apart from the standard input 
for INTEGER LINEAR PROGRAMMING FEASIBILITY (i.e., a matrix A € Z™*? 
and a vector b € Z™) we are given a vector c € ZP, and our goal is to find 
a vector x € ZP satisfying all the aforementioned inequalities (i.e., Ax < b) 
that minimizes the objective function c- x (the scalar product of c and 2). 

Using binary search, it is easy to derive an algorithm for INTEGER LINEAR 
PROGRAMMING using Theorem |6.4 


Theorem 6.5. An INTEGER LINEAR PROGRAMMING instance of size L with 
p variables can be solved using 


O(p2- Pt) - (L + log Mz) log(M,M.)) 


arithmetic operations and space polynomial in L + log Mz, where M, is an 
upper bound on the absolute value a variable can take in a solution, and M, 
is the largest absolute value of a coefficient in the vector c. 


Proof. Observe that the absolute value of the objective function is at most 
pM, M, as long as the variables have absolute values at most M,. We perform 
a binary search to find the minimum value of the objective function. That is, 
for a fixed integer threshold —pM,M, < t < pM,M.-, we add an inequality 
cx < t to the system Ax < b and apply the algorithm of Theorem [6.4] to 
the obtained INTEGER LINEAR PROGRAMMING FEASIBILITY instance. The 
instance has size O(L + plog(pM,M.)) = O(p(L + log M,,)), and hence each 
application of Theorem runs in time O(p?5P+e() . (L + log M,)). Con- 
sequently, we are able to find an optimum value to of the objective function 
within the promised bound on the running time. Moreover, any solution to 
the INTEGER LINEAR PROGRAMMING FEASIBILITY instance consisting of the 
system Ax < b with an additional inequality cx < to is an optimal solution 
to the input INTEGER LINEAR PROGRAMMING instance. 

















6.2.1 The example of IMBALANCE 


Now we exemplify the usage of Theorem [6.5] on the IMBALANCE problem. 
In order to define the problem itself, we need to introduce some no- 
tation. Let G be an n-vertex undirected graph. An ordering of V(G) is 
any bijective function 7: V(G) > {1,2,...,n}. For v € V(G), we define 
L,(v) = {ue N(v) : n(u) < r(v)} and R,(v) = {u € N(v) : a(u) > 
m(u)} = N(v) \ L,(v). Thus L,(v) is the set of vertices preceding v, and 
R,(v) is the set of vertices succeeding v in m. The imbalance at vertex v is 
defined as t,(v) = ||Lr(v)| —|Rx(v)||, and the imbalance of the ordering m 


equals (7) = Dyev(q) tz (v)- 
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In the IMBALANCE problem, parameterized by the vertex cover number 
of a graph, we are given a graph G together with its vertex cover X of size 
k, and the task is to find an ordering 7 of V(G) minimizing its imbalance. 
The parameter is k, the size of the provided vertex cover, as opposed to — 
maybe more natural after most of the previous examples in this book — 
the parameterization by solution size (the value of the objective function, 
the imbalance of the ordering in question). We remark that there is an FPT 
algorithm for IMBALANCE parameterized by the imbalance of the ordering, 
but such an algorithm is not the goal of this section. 

The IMBALANCE problem, parameterized by the vertex cover number, falls 
into the wide topic of so-called structural parameterizations. Here, instead of 
taking the most natural “solution size” parameterization (as was the case, e.g., 
for VERTEX COVER or FEEDBACK VERTEX SET in previous chapters), we 
pick as a parameter some structural measure of the instance (most usually, 
a graph) at hand. In parameterized complexity, the choice of the parame- 
ter comes from the application: we would like to solve efficiently instances 
for which the parameter is small. Hence, studying a computational problem 
both using the “solution size” parameterization and different structural pa- 
rameters, such as the vertex cover number, refines our understanding of the 
matter by considering complementary angles: we design efficient algorithms 
for different classes of input instances. Furthermore, from a theoretical point 
of view, different parameterizations often offer different interesting insights 
into the studied problem. This is yet another example of flexibility provided 
by parameterized complexity in choosing parameters. 

We remark that this is not the only example of structural parameterization 
in this book. The entire Chapter [7]is devoted to treewidth, an important graph 
parameter that measures the resemblance of a graph to a tree. Moreover, in 
Section [15.2.4] we show an example of kernelization lower bound for CLIQUE, 
parameterized by the vertex cover number. 

We remark that in the case of parameterization by the vertex cover num- 
ber, it is not necessary that we require that some vertex cover is provided 
along with the input graph, but instead we can compute a 2-approximation 
of a minimum vertex cover. In this manner, we obtain a parameter that is 
at most twice as large. Another approach would be to use an FPT algorithm 
to compute a vertex cover of size at most k. However, in the case of several 
other structural parameters of the input graph (e.g., the size of a dominating 
set) we do not have a good approximation or FPT algorithm to fall back on, 
and hence, in this book, we define all problems with structural parameters 
by insisting on providing the promised structure in the input. 

We now show how to apply Theorem |6.5]in order to give an FPT algorithm 
for IMBALANCE parameterized by the size of the provided vertex cover of G. 

We are looking for an ordering 7 for which (7) is minimized. In order to do 
this, we loop over all possible orderings (bijections) mx: X —> {1,2,...,k} of 
the vertex cover X and, for every such ordering mx, we find the best ordering 
n of V(G) that agrees with mx: We say that a and mx agree if for all u,v € X 
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we have that mx(u) < mx(v) if and only of n(u) < m(v). In other words, the 
relative ordering 7 imposes on X is precisely 7x. Thus, at a cost of a factor of 
k! in the running time, we can assume that there exists an optimal ordering 7 
such that X = {u1, u2,..., Uk} and m(ui) < (uz) <... < 7(ux). For every 
0<i<k, we define X; = {u,ua,..., Ui}. 

Because X is a vertex cover, the set I = V(G) \ X is independent. We 
associate a type with each vertex in I as follows. 


Definition 6.6. The type of a vertex v € I is the set N(v) C X. For a type 
S C X, the set I(S) is the set of all vertices in I of type S. 


Notice that the number of different types does not exceed the number of 
subsets of X, which is 2”. 

Observe that every vertex of I is either mapped between two vertices of 
X, to the left of u1, or to the right of uz by an optimal ordering m. We say 
that a vertex v € I is at location 0 if m(v) < m(u,) and at location i if i is 
the largest integer such that m(u;) < m(v). The set of vertices that are at 
location 7 is denoted by L;. We define the inner order of m at location i to 
be the restriction of 7 to Li. 

The task of finding an optimal permutation can be divided into two parts. 
The first part is to partition the set J into [o,..., Lp, while the second part 
consists of finding an optimal inner order at all locations. One should notice 
that partitioning J into Lo,..., Lp amounts to deciding how many vertices of 
each type are at location i for each i. Moreover, observe that the second part 
of the task, namely permuting the inner order of m at location i, in fact does 
not change the imbalance at any single vertex. This is due to the fact that 
the vertices of J, and thus all vertices at location 7, are pairwise nonadjacent. 
Hence, the inner orders are in fact irrelevant and finding the optimal ordering 
of the vertices thus reduces to the first part of finding the right partition of 
I into Lo,..., Lp. Our goal for the rest of this section is to phrase this task 
as an INTEGER LINEAR PROGRAMMING instance. 

For a type S and location i, we let ee be a variable that encodes the 
number of vertices of type S that are at location i. Also, for every vertex 
ui in X, we introduce a variable y; that represents the lower bound on the 
imbalance of u;. 

Let us now describe the inequalities. First, in order to represent a feasible 
permutation, all the variables must be nonnegative. Second, the variables A 
have to be consistent with the fact that we have |I(S)| vertices of type S, 
that is, Tia xy = |I(S)| for every type S. 

For every vertex u; of the vertex cover X, we let e; = |N (wi) O Xi-1| — 
|N(u;) N (X \ X;)|. For every u; € X we have a constraint 





i—1 k 
Yi Z e+) EEDE ; (6.3) 
scx \j=0 j=i 


wes 
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Finally, for every type S and location i, let the constant z$ be equal to the 
imbalance at a vertex of type S if it is placed at location i. That is, 


z5 =||SN Xi] = 50 (X \ XI. 


We are now ready to formulate our INTEGER LINEAR PROGRAMMING in- 
stance. 


min SuD 5 gia 


i=0 SCX 


s.t. Sai = |I(S)| VS CX 


y2et > Se Da YI<i<k 





scx \j=0 
uEsS 
Yi Z —ei — 5 DA Da VI<i<k 
scx \j=0 
wieS 
xy > 0 VO<i<k, SCX. 


A few remarks are in place. First, since is not a linear constraint, we have 
represented it as two constraints in the INTEGER LINEAR PROGRAMMING 
instance. 

Second, formally, in the integer linear program above, we do not require y; 
to be exactly the imbalance at u;, but to be at least this imbalance. However, 
the minimization criterion forces y; to be actually equal to the imbalance at 
Ui. 

Finally, observe that the maximum possible value of a variable x4, is less 
than n, the maximum possible value of a variable y; in an optimal solution 
(where the inequality is in fact an equality) is less than n, and the maxi- 
mum absolute value of a coefficient z% is, again, less than n. Consequently, an 
appieatan a ee our INTEGER LINEAR PROGRAMMING instance 
runs in 22° n° time, and we obtain the following result. 


Theorem 6.7. The IMBALANCE problem, parameterized by the size of a pro- 
vided vertex cover of the input graph, is fixed-parameter tractable. 
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6.3 Graph minors and the Robertson-Seymour theorem 


In this section, we discuss the minor relation in graphs and the algorithmic 
usage of the Robertson-Seymour theorem. 

For a graph G and an edge uv € G, we define the operation of contracting 
edge uv as follows: we delete vertices u and v from G, and add a new vertex 
Wuv adjacent to (Ne(u) U Ne(v)) \ {u,v} (ie., to all vertices that u or v 
was adjacent to in G). Observe that contraction defined as above does not 
introduce any multiple edges or loops (note that we used a different definition 
when treating FEEDBACK VERTEX SET in Section [3.3). Let G/uv be the 
graph obtained by contracting edge uv in G. 

We say that a graph H is a minor of G, denoted by H <,, G, if H 
can be obtained from some subgraph of G by a series of edge contractions. 
Equivalently, we may say that H is a minor of G if H can be obtained from 
G itself by a series of edge deletions, edge contractions and vertex deletions. 

There is also a different definition that in some cases is more convenient: 
H is a minor of G if for every h € V(H) we can assign a nonempty branch 
set Vn C V(G), such that 


(a) G[Vp] is connected; 

(b) for different g,h € V(H), the branch sets V} and V, are disjoint; and 

(c) for every gh € E(#) there exists an edge vgv;, € E(G) such that vg € Vy 
and up € Vp. 


Such a family (Vi)nevim) of branch sets is called a minor model of H in G. 
Exercise [6.12] asks the reader to check the equivalence of the aforementioned 
definitions. 

The minor relation in some sense preserves the topological properties of a 
graph, for example, whether a graph is a planar graph. 

Let us now clarify what we mean by a planar graph, or a graph embedded 
into the plane. First, instead of embedding into the plane we will equivalently 
embed our graphs into a sphere: in this manner, we do not distinguish unnec- 
essarily the outer face of the embedding. Formally, an embedding of a graph 
G into a sphere is a mapping that maps (a) injectively each vertex of G into a 
point of the sphere, and (b) each edge wv of G into a Jordan curve connecting 
the images of u and v, such that the curves are pairwise disjoint (except for 
the endpoints) and do not pass through any other image of a vertex. A face 
is a connected component of the complement of the image of G in the sphere; 
if G is connected, each face is homeomorphic to an open disc. A planar graph 
is a graph that admits an embedding into a sphere, and a plane graph is a 
planar graph together with one fixed embedding. 

It is easy to observe the following. 


Proposition 6.8. A minor of a planar graph is also planar. 
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Fig. 6.2: Torus 


Indeed, removing edges and vertices surely cannot make a planar graph non- 
planar and it is not difficult to modify a planar embedding to express the 
contraction of an edge. 

A statement similar to Proposition [6.8] holds when we generalize planar 
graphs to graphs drawn on some other, more complicated, but fixed surface. 
In this chapter, we discuss graphs drawn on surfaces only very briefly and 
informally, as they only serve here as a demonstration of certain concepts in 
the theory of graph minors. It is well known that a graph can be drawn on 
the plane if and only if it can be drawn on the sphere. However, there are 
graphs that can be drawn on the torus (the doughnut-shaped surfaced shown 
on Fig. (6.2), but not on the sphere — the clique Ks is such a graph. One 
way to construct surfaces is to add some number of “handles” to a sphere: for 
example, we may consider the torus as a sphere with a handle attached to it. 
There are other, more weird surfaces, such as the Klein bottle or the projective 
plane, that cannot be obtained this way and where we need to introduce an 
orientation-changing construction called a cross-cap, along with handles. 
We do not go into the exact definitions and details of these constructions, 
all we want to state here is that if a graph G is embeddable in a surface 
X, and H is a minor of G, then H is embeddable on X as well. For readers 
interested in Topological Graph Theory, we provide references to sources in 
the bibliographic notes at the end of this chapter. 

It will be very important for some of the applications that certain struc- 
tural parameters are monotone with respect to taking minors: for example, 
the minimum size of a vertex cover or feedback vertex set is not increased 
when taking the minor of a graph. The proof of the following proposition is 
left as an exercise (Exercise |6.13). 


Proposition 6.9. Let G be a graph, let X C V(G) be a vertex cover (feedback 
vertex set) of G, and let H be a minor of G. Then there exists a vertex cover 
(feedback vertex set) of H of size at most |X]. 
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Another example of a graph property that cannot increase if we go to a minor 
of a graph is treewidth, discussed in Chapter [7] 

Although the minor relation is already interesting in itself, the main thrust 
to its usability is given by the following seminal result of Robertson and 
Seymour. 


Theorem 6.10 (Robertson and Seymour). The class of all graphs is 
well-quasi-ordered by the minor relation. That is, in any infinite family of 
graphs, there are two graphs such that one is a minor of the other. 


The statement of the Robertson-Seymour theorem above is sometimes re- 
ferred to in the literature as the Wagner’s conjecture or the graph minors 
theorem. In what follows, we stick to the name Robertson-Seymour theorem. 
As we shall see in the rest of this section, the Robertson-Seymour theorem is 
a very powerful tool in parameterized complexity. 

Consider a class of graphs G. We say that G is closed under taking minors 
or minor-closed if for every G € G and every minor H of G, the graph H 
belongs to G as well. For example, by Proposition the class of planar 
graphs, or more generally, graphs embeddable on a surface of fixed genus g, 
is minor-closed. Proposition [6.9] asserts that, for every fixed k, the class of 
graphs admitting a vertex cover of size at most k or a feedback vertex set of 
size k, is minor-closed. 

We observe now the following crucial corollary of the Robertson-Seymour 
theorem. 


Corollary 6.11. For every minor-closed graph class G, there exists a finite 
set Forb(G) of graphs with the following property: for every graph G, graph G 
belongs to G if and only if there does not exist a minor of G that is isomorphic 
to a member of Forb(G). 


Proof. Define Forb(G) to be the set of minor-minimal elements of the comple- 
ment of G: a graph G is in Forb(G) if G is not in G, but every proper minor 
of G is in G. We keep only one representative of every isomorphism class. 
By definition, Forb(G) has the desired property; it remains to check only its 
finiteness. However, if Forb(G) is infinite, then the Robertson-Seymour the- 
orem implies that there exists H,G € Forb(G) such that H <m G. However, 
then G is not minimal with respect to the minor relation, contradicting the 
definition of Forb(G). 














The family Forb(G) is often called the family of minimal forbidden minors 
for G. 

For example, by Wagner’s theorem, a graph is planar if and only if it does 
not contain Ks and K3 3 as a minor. Thus for planar graphs Corollary [6.11] 
is not so interesting. However, already for the class of graphs embeddable in 
a torus (see Fig |6.2) the situation is not that clear. By Corollary we 
know that the family of forbidden minors for this class is finite, but their 
number can be enormous and it is not clear how we can find them efficiently. 
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Currently, more than 16,000 graphs are known from the family of forbidden 
minors for the class of graphs embeddable in the torus. More generally, for 
graphs embeddable on a surface of genus g, i.e., a sphere with g handles, 
we only know that the number of forbidden minimal minors for this class of 
graphs is bounded by some function of g. Similarly, by Corollary for 
every fixed k, the class of graphs admitting a vertex cover of size at most k 
has a forbidden family of minimal minors of size f(k) for some function f. 

It is important to note here that the proof of Corollary [6.11] is noncon- 
structive, i.e., while we know that set Forb(G) is finite, we do not know how 
to construct it and we do not know how to estimate its size. 

We need one more algorithmic result of the Graph Minors project. 


Theorem 6.12 (Robertson and Seymour). There exists a computable 
function f and an algorithm that, for given graphs H and G, checks in time 
f(H)|V(G)|> whether H <m G. 


Corollary with Theorem implies that every minor-closed class 
can be recognized in polynomial time. 


Theorem 6.13. Let G be a minor-closed graph class. There is a constant cg 
depending on the class G only, such that for any n-vertex graph G, deciding 
whether G € G can be done in time cg enè. 

Proof. By Corollary [6-11] there exists a family Forb(G) that characterizes G 
and whose size and the sizes of its elements are at most some constant cg 
depending only on G. Consider the following algorithm: iterate through all 
graphs H € Forb(G) and use Theorem [6.12] to check if H <,, G. If this is 
the case for at least one graph H € Forb(G), then G ¢ G and we answer that 
G ¢G. Otherwise, by the properties of Forb(G), we have that G € G. 

Let us bound the running time of the algorithm. By Theorem for 
every H € Forb(G), we check whether H is a minor of G in time f(H) - 
n? < f(cg) +n. The size of Forb(G) does not exceed cg, and by putting 
cg = cg : f (cg), we conclude the proof of the theorem. 














Let us now apply Theorem [6.13] to VERTEX COVER. Let (G,k) be a VER- 
TEX COVER instance. By Proposition the class G; of graphs admitting 
a vertex cover of size at most k is minor-closed. Thus by Theorem for 
every k > 0, there is an algorithm deciding if a graph G is in Gx in time cken’, 
where constant ck depends only Gk, that is cg = f(k) for some function f. 
Have we just shown another proof that VERTEX COVER is fixed-parameter 
tractable? 

Well, not exactly. Given only the integer k, we do not know the family 
Forb(G;,) — Corollary only asserts its existence and finiteness — and 
the knowledge of this family or at least its size is essential for further steps. In 
general, it seems hard to get around the problem of obtaining the forbidden 
minors: for example, it was shown by Fellows and Langston in that, 
given only oracle access to a membership test for a minor-closed class G, 
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one cannot compute Forb(G). However, we have at least proved the following 
statement, which is somewhat weaker than fixed-parameter tractability. It 
should be noted that we obtained this result without any algorithmic insight 
into the problem: we only used the general tool Theorem |6.13] and the fact 
that the vertex cover number does not increase when taking minors. 


Corollary 6.14. For every k, there is a constant cy, and an algorithm that, 
given an n-verter graph G, checks in time cp - n° if G admits a vertex cover 
of size at most k. 


Corollary |6.14]asserts the existence of a family of algorithms, one for every 
k, and the forbidden minors Forb(G;,) are hard-coded into the algorithm for 
the parameter k. Such a conclusion as in Corollary [6.14]is called nonuniform 
fized-parameter tractability. . 


Definition 6.15. We say that a parameterized problem Q is nonuniformly 
fized-parameter tractable if there exists a constant a, a function f : N —> N, 
and a collection of algorithms (A,),en such that the following holds. For 
every k € N and every input x, the algorithm A, accepts input «x if and only 
if (x, k) is a yes-instance of Q, and the running time of A, on x is at most 


F(k)|æ|®. 


The above notion should be contrasted with our definition of fixed-parameter 
tractability, where we require that there exists a single algorithm that takes 
k on input and works for all values of k. To emphasize the difference, our 
“standard” FPT algorithms are sometimes called (strongly) uniform. 

Let us now generalize Corollary [6.14] to a wider set of problems. Let G 
be a graph class. Many interesting problems can be defined as special cases 
of the following generic G VERTEX DELETION problem: for a given graph 
G and integer k, does there exist a set X of at most k vertices of G such 
that G — X € G? For example, when G is the class of graphs without edges, 
G VERTEX DELETION is VERTEX COVER. When G is the class of forests, G 
VERTEX DELETION is FEEDBACK VERTEX SET, and when G is planar, G 
VERTEX DELETION is PLANAR VERTEX DELETION, that is, the problem of 
whether a given graph G can be turned into a planar graph by deleting at 
most k vertices (we study this problem in detail in Section [*7.8}. 

From a graph-theoretical point of view, the following formulation seems 
more natural. For a graph class G and an integer k, let G + kv be a class of 
graphs defined as follows: 


GeG+kv if and only if 3X CV(G): (|X| <k)A(G-X €G). 





That is, G + kv are exactly these graphs G for which (G, k) is a yes-instance 
to G VERTEX DELETION. For this reason, the G VERTEX DELETION problem 
is sometimes called G + kv RECOGNITION. 

Proposition [6.9] generalizes to the following statement (see Exercise |6.14): 
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Proposition 6.16. For every minor-closed graph class G and for every inte- 
ger k, the class G + kv is also minor-closed. 


Let us note that the only property of VERTEX COVER we have used to 
derive Corollary is that the class of graphs with vertex cover at most k 
is minor-closed. Thus we have the following theorem. 


Theorem 6.17. For every minor-closed class G, the problem G VERTEX 
DELETION is nonuniformly fixed-parameter tractable, when parameterized by 


k. 


Although the notion of nonuniform fixed-parameter tractability evidently 
captures more problems than the standard, uniform onef] we believe that 
there is no significant difference among the “interesting” problems. In par- 
ticular, it is very likely that the theory of fixed-parameter intractability and 
W[1]-hardness described in Chapter [13] excludes also nonuniform FPT algo- 
rithms. That is, our conjecture is that no W[1|-hard problem is nonuniformly 
fixed-parameter tractable. 


Thus while Theorem does not bring us to an FPT algorithm, it is 
very convenient to use it to make an “educated guess” that a problem 
is FPT. 


In many cases, after using Theorem [6.13] to establish that a problem is 
nonuniformly FPT and hence very likely to be uniformly FPT as well, we 
can use other, more concrete techniques to prove that the problem is indeed 
uniformly FPT. For example, VERTEX COVER and FEEDBACK VERTEX SET 
are special cases of G VERTEX DELETION and thus by Theorem are 
nonuniformly FPT. On the other hand, we already saw FPT algorithms for 
these problems in Chapter |3| For PLANAR VERTEX DELETION, we give an 
FPT algorithm in sooner Another example is the GRAPH GENUS prob- 
lem: for a given graph G and integer k decide whether G can be embedded 
in a surface of genus k, i.e. a surface with k handles. The class of graphs 
of genus at most k is minor-closed and thus GRAPH GENUS is nonuniformly 
fixed-parameter tractable. An explicit FPT algorithm for GRAPH GENUS has 
been given by Mohar [365]. There are also some general techniques for mak- 
ing the algorithms resulting from Theorem [6.13] uniform, but they work only 
for a limited set of problems [5]. 

In Exercises and we give two other examples of usages of 
the Robertson-Seymour theorem: PLANAR DIAMETER IMPROVEMENT (does 
graph G have a planar supergraph with diameter at most k?) and LINKLESS 
EMBEDDING (does graph G have an embedding in three-dimensional space 





1 The standard examples for P vs. P / poly apply here: any undecidable unary language, 
parameterized by the length of the input, does not admit a (uniform) FPT algorithm (since 
it is undecidable), but there is a trivial nonuniform one, as every algorithm A; is supposed 
to handle only one input. 
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such that at most k cycles can link pairwise?). In both cases, the existence 
of a uniform FPT algorithm remains open. 


Exercises 


6.1 (Z). Using dynamic programming over the subsets, obtain an algorithm for CHRO- 
MATIC NUMBER on n-vertex graphs running in time 37M), 


6.2 (Z). Using dynamic-programming over subsets, show that the HAMILTONIAN CYCLE 
problem on an n-vertex graph can be solved in time 2"n0(), 


6.3. For an n x n matrix A = (ai j)1<i,j<n, the permanent of A is the value perm(A) = 
Xo Tie %,0(%)> Where the sum extends over all permutations o of {1,2,...,n}. (This is 
very similar to the definition of the determinant, but we do not have here the 1 or —1 factor 
depending on the number of inversions in the permutation.) Using dynamic programming 
over subsets, show how to compute the permanent of a given n x n matrix in time 2”n? 0), 


6.4. Using dynamic programming over subsets, show that DIRECTED FEEDBACK ARC SET 
on n-vertex graphs can be solved in time 2"n°), 


6.5 (8). Let G be an undirected bipartite graph on n vertices. Show that we can find a 
minimum size dominating set in G in time 7/220), 


6.6 (2). Show that CONNECTED VERTEX Cover admits an algorithm with running time 
6*nO), You might need to use an algorithm for STEINER TREE of Theorem [6.3] 


6.7. Given a directed graph G, a set of terminals K C V(G) and a root r € V(G), 
DIRECTED STEINER TREE asks for a directed tree rooted at r such that every terminal 
in K is reachable from r on the tree. Obtain a 3!*ln©()-time algorithm for DIRECTED 
STEINER TREE. 


6.8 (8). Consider the following restricted variant of STEINER TREE: assume G is a plane 
graph, and all terminals lie on the infinite face of G. Show how to enhance the dynamic- 
programming algorithm of Theorem[6.3]to run in polynomial time in this restricted case. 


6.9 (£2). Improve the algorithm of Theorem [6.3] so that the factor in the running time 
bound that depends on the size of the input graph equals the running time of a single- 
source shortest-path algorithm. That is, obtain a 3!*!|K|°(™ (n + m)-time algorithm for 
unweighted graphs and 3!*!|K|C)(nlogn + m)-time for the general, weighted case. 


In the next two exercises we revisit the CLOSEST STRING problem, which was already 
considered in Chapter Recall that in this problem we are given a set of k strings 
@1,22,...,% over alphabet X, each of length L, and an integer d. The task is to find 
a string y of length L such that the Hamming distance between y and x; is at most d, for 
every 1 < į < k. In Section we designed an FPT algorithm for this problem parameter- 
ized by d, while in Exercise|3.25]you were asked to design an algorithm that is faster when 
the alphabet size is small. This time, we consider the parameterization by k, the number 
of strings. 


6.10 (È). Prove that CLOSEST STRING is fixed-parameter tractable when parameterized 
by k and |X]. 
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6.11 (2). Refine the solution of the previous exercise to show that CLOSEST STRING is 
fixed-parameter tractable when parameterized by k only. 


6.12 (&). Prove that the two definitions of the minor relation, mentioned in Section [6.3] 
are equivalent. 


6.13 (Æ). Show that vertex cover and feedback vertex set are minor-closed parameters. 
In other words, for any minor H of graph G, if G admits a vertex cover (feedback vertex 
set) of size at most k, then H admits a vertex cover (feedback vertex set) of size at most 
k as well. 


6.14. Prove Proposition 


6.15 (@). In the Max Lear SuBTREE problem, we are given a graph G together with an 
integer k and the question is whether there is a subtree T of G with at least k leaves. 


1. Show that (G, k) is a yes-instance if and only if K1,, (a graph with a center vertex 
connected to k degree-one vertices) is a minor of G. 

2. Deduce that Max LEAF SUBTREE is nonuniformly fixed-parameter tractable, when 
parameterized by k. 


6.16. In the PLANAR DIAMETER IMPROVEMENT problem, the input consists of a planar 
graph G and an integer k, and the task is to check if there exists a supergraph of G that 
is still planar, and at the same time has diameter at most k. 


1. Prove that this problem is nonuniformly fixed-parameter tractable, when parameter- 
ized by k. 

2. Show that it suffices to consider supergraphs only G’ of G with V(G’) = V(G); that 
is, it only makes sense to add edges to G, and adding new vertices does not help. 


6.17. Let G be a graph embedded into R® (that is, every vertex of G corresponds to some 
point in R3, and every edge of G corresponds to some sufficiently regular curve connecting 
its endpoints; the edges/vertices do not intersect unless it is imposed by the definition). Two 
vertex-disjoint cycles Cı and C2 are said to be linked in this embedding if the corresponding 
closed curves in R? cannot be separated by a continuous deformation (i.e., they look like 
two consecutive links of a chain). A family C of pairwise vertex-disjoint cycles is pairwise 
linked if every two distinct cycles from the family are linked. In the LINKLEss EMBEDDING 
problem, given a graph G and an integer k, we ask whether there exists an embedding of 
G into R3 such that any pairwise linked family of cycles in G has size at most k. Prove 
that this problem is nonuniformly fixed-parameter tractable, when parameterized by k. 


6.18. In the Face Cover problem, we are given a planar graph G and an integer k, and 
the task is to check if there exists a planar embedding of G and a set of at most k faces 
in this embedding, such that every vertex of G lies on one of the chosen faces. Prove that 
this problem is nonuniformly fixed-parameter tractable, when parameterized by k. 


6.19. In the CycLe Packine problem the input is a graph G with an integer k and the 
task is to determine whether there exist k cycles in G that are pairwise vertex disjoint. 
Prove that this problem is nonuniformly fixed-parameter tractable, when parameterized 
by k. 
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Hints 


In the dynamic-programming table, for every X C V(G), define T[X] to be the 
minimum possible number of colors needed to color G[X]. For the base case, observe that 
T[X] = 1 if and only if X is a nonempty independent set in G. 


Fix a vertex s € V(G). In the dynamic-programming table, for every set X C V(G) 
that contains s, and every t € V(G) \ X, define T[X,t] to be a Boolean value indicating 
whether there exists in G a simple path with endpoints s and t and vertex set X U {t}. 


In the dynamic-programming table, for every X C {1,2,...,n}, define T[X] to be the 
permanent of the |X| x |X| matrix (4:,3)1<i<|x|jex- 


In the dynamic-programming table, for every X C V(G), define T[X] to be the 
minimum possible number of edges of G[X] whose deletion makes G[X] acyclic. For a 
recursive formula, for fixed set X, consider all possible choices of a vertex v € X that 
is the last vertex in the topological ordering of G[X] after the solution edges have been 
deleted. 


Let A and B be the bipartition sides of the input bipartite graph G and without loss 
of generality assume |A| < |B|, in particular, |A| < n/2. In general, perform a dynamic- 
programming algorithm like that of Theorem [6.1] for SET COVER instance with universe 
A and set family {Ng(b) : b € B}. However, to ensure that the side B is dominated, 
and at the same time allow some vertices of A to be chosen into a dominating set, change 
the base case as follows. Let D be a minimum size dominating set in G and observe that 
B\ Na(Dn A) C D. For every possible guess of D4 C A for the set DM A, we would 
like to allow in the dynamic-programming routine an option of taking D4 U(B\ Ng(Da)) 
into the solution; from the point of view of the SET Cover instance with universe A, this 
translates to covering D4 U Nc(B \ Ng(Da)) at cost |D4| + |B \ Ne@(Da)|. To achieve 
this goal, we compute the values T[X, 0] as follows. 


1. First set T[X,0] = +00 for every X C A. 

2. For every Da C A, define X = Da U Ne(B \ Nag(Da)) and set T[X,0] := 
min(T[X, 0], |DA] + |B \ No(Da)l)- 

3. For every X C A in the decreasing order of the size of X, and for every v € A\ X, 
set T[X, 0] := min(T[X, 0], TX U {v}, 0]). 


In this manner, T[X,0] equals a minimum possible size of a set DA U (B \ Ng(Da)) that 
dominates X, over all D4 C A. 


Observe that CONNECTED VERTEX COVER can be seen as a two-stage problem: first 
choose some vertex cover K of the input graph G, of size at most k, and then connect 
K using minimum size Steiner tree for K. Furthermore, note that the straightforward 
branching algorithm for VERTEX Cover runs in 2*n°(@) time and in fact outputs all 
inclusion-wise minimal vertex covers of G of size at most k. 


The main observation is as follows: it suffices to consider only sets D C K that consist 
of a number of terminals that appear consecutively on the infinite face of the input graph. 
In this way, there are only O(|K|?) sets D to consider. 


The main idea is to perform the computations in a different order: for a fixed choice 
of sets D’ and D, we would like to compute 


T’[D, D',v] = min T[D’,w]+T[D \ D’,w] + distg(v, w). 
[D, Div] = min TID! w] +TID\ Di, w] + distes(v, w) 


for all vertices v € V(G) \ K using a single run of a single source shortest path algorithm. 
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To this end, create an auxiliary graph G” as follows: create a new vertex s and for every 
w E€ V(G)\K create an edge sw of weight T[D’, w] + T[D \ D’, w]. Observe that for every 
v € V(G), we have distg(s,v) = T’[D, D’, v]. Hence, to compute the values T’[D, D’, v] 
for all vertices v, it suffices to run a single source shortest path algorithm from s in the 
graph G”. 

To obtain linear time in the unweighted case, observe that in this case all weights of 
the edges sw are positive integers of order O(n), and still a single source shortest path 
algorithm can be performed in linear time. 


We shall use integer linear programming, so as to avoid confusion with variables we 


use notation s1,s2,...,8, for the input strings. 
For every position 1 < j < L, let s;[j] be the j-th letter of the string si, and let 
si = (sı[j], s2[j],---, sk[j]) E€ XF be the tuple at position j. Define an INTEGER LINEAR 


PROGRAMMING instance using the following variables: for every s € XF and for every 
o € X, the variable £s, counts the number of positions j such that s’? = s and, in the 
solution s, the letter at position j is exactly o. 


Improve the approach of the previous exercise in the following way: we say that two 
tuples s,s’ € XF are equivalent if, for every 1 < i,i! < k, we have s(i) = s(i') if and only 
if s’(z) = s/(i’). Show that the variables corresponding to equivalent tuples can be merged 
(you need to be careful here with the index a) and you can have only k°(*) variables in 
your INTEGER LINEAR PROGRAMMING instance. 


Let G € G + kv and let X C V(G) be such that |X| < k and G — X € G. Let H be 
a minor of G, and let (Vh)hev(z) be a minor model of H in G. Define Y = {h € V(H) 
X N Vn AO}. Clearly, |Y| < |X| < k. Let Z = Upey Vn C V(G). Observe that G — Z is a 
subgraph of G — X. Moreover, (Vi)nev(H)\y is a minor model of H — Y in G — Z. Since 
G is minor-closed, we have H — Y € G and, consequently, H € G + kv. 


The main observation is that edge contraction cannot increase a diameter. Deduce 
from this fact that, for every fixed k, the class of graphs G for which (G, k) is a yes-instance 
to PLANAR DIAMETER IMPROVEMENT is minor-closed. 


Again, show that for fixed k, the class of graphs G for which (G, k) is a yes-instance 
is minor-closed. 
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The Graph Minors project of Robertson and Seymour spans over 23 papers, most of 
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There are several constructive algorithms for PLANAR VERTEX DELETION [358] [285] [277], 
with the current fastest algorithm by Jansen, Lokshtanov, and Saurabh having run- 
ning time kO(*)n, 

An embedding of a graph G into R3 is called linkless if every two vertex-disjoint cycles 
of G can be separated from each other by a continuous transformation (i.e., they do not 
look like two consecutive links of a chain). In the LINKLEss EMBEDDING problem, defined 
in Exercise[6.17] the graphs admitting a linkless embedding are yes-instances for k < 1. An 
explicit family of forbidden subgraphs for graphs admitting a linkless embedding was first 
conjectured by Sachs, and proved to be correct by Robertson, Seymour and Thomas [404]. 
There has been substantial work (e.g., and [267]) to understand the computational 
complexity of finding a linkless embedding of a graph. 


Chapter 7 
Treewidth 


The treewidth of a graph is one of the most frequently 
used tools in parameterized algorithms. Intuitively, 
treewidth measures how well the structure of a graph 
can be captured by a tree-like structural decomposition. 
When the treewidth of a graph is small, or equivalently 
the graph admits a good tree decomposition, then many 
problems intractable on general graphs become effi- 
ciently solvable. In this chapter we introduce treewidth 
and present its main applications in parameterized 
complexity. We explain how good tree decompositions 
can be exploited to design fast dynamic-programming 
algorithms. We also show how treewidth can be used 
as a tool in more advanced techniques, like shifting 
strategies, bidimensionality, or the irrelevant vertex 
approach. 





In Section [6-3] we gave a brief overview on how the deep theory of Graph 
Minors of Robertson and Seymour can be used to obtain nonconstructive 
FPT algorithms. One of the tools defined by Robertson and Seymour in 
their work was the treewidth of a graph. Very roughly, treewidth captures 
how similar a graph is to a tree. There are many ways to define “tree-likeness” 
of a graph; for example, one could measure the number of cycles, or the 
number of vertices needed to be removed in order to make the graph acyclic. 
However, it appears that the approach most useful from algorithmic and 
graph theoretical perspectives, is to view tree-likeness of a graph G as the 
existence of a structural decomposition of G into pieces of bounded size that 
are connected in a tree-like fashion. This intuitive concept is formalized via 
the notions of a tree decomposition and the treewidth of a graph; the latter 
is a quantitative measure of how good a tree decomposition we can possibly 
obtain. 

Treewidth is a fundamental tool used in various graph algorithms. In pa- 
rameterized complexity, the following win/win approach is commonly ex- 
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ploited. Given some computational problem, let us try to construct a good 
tree decomposition of the input graph. If we succeed, then we can use dy- 
namic programming to solve the problem efficiently. On the other hand, if 
we fail and the treewidth is large, then there is a reason for this outcome. 
This reason has the form of a combinatorial obstacle embedded in the graph 
that forbids us to decompose it expeditiously. However, the existence of such 
an obstacle can also be used algorithmically. For example, for some problems 
like VERTEX COVER or FEEDBACK VERTEX SET, we can immediately con- 
clude that we are dealing with a no-instance in case the treewidth is large. 
For other problems, like LONGEST PATH, large treewidth implies that we are 
working with a yes-instance. In more complicated cases, one can examine the 
structure of the obstacle in the hope of finding a so-called irrelevant vertex 
or edge, whose removal does not change the answer to the problem. Thus, 
regardless of whether the initial construction of a good tree decomposition 
succeeded or failed, we win: we solve the problem by dynamic programming, 
or we are able to immediately provide the answer, or we can simplify the 
problem and restart the algorithm. 

We start the chapter by slowly introducing treewidth and tree decom- 
positions, and simultaneously showing connections to the idea of dynamic 
programming on the structure of a graph. In Section [7.1] we build the intu- 
ition by explaining, on a working example of WEIGHTED INDEPENDENT SET, 
how dynamic-programming procedures can be designed on trees and on sub- 
graphs of grids. These examples bring us naturally to the definitions of path 
decompositions and pathwidth, and of tree decompositions and treewidth; 
these topics are discussed in Section In Section [7.3] we provide the full 
framework of dynamic programming on tree decompositions. We consider 
carefully three exemplary problems: WEIGHTED INDEPENDENT SET, DOM- 
INATING SET, and STEINER TREE; the examples of DOMINATING SET and 
STEINER TREE will be developed further in Chapter [11] 

Section [7.4] is devoted to connections between graphs of small treewidth 
and monadic second-order logic on graphs. In particular, we discuss a power- 
ful meta-theorem of Courcelle, which establishes the tractability of decision 
problems definable in Monadic Second-Order logic on graphs of bounded 
treewidth. Furthermore, we also give an extension of Courcelle’s theorem to 
optimization problems. 

In Section [7.5] we present a more combinatorial point of view on pathwidth 
and treewidth, by providing connections between these graph parameters, 
various search games on graphs, and classes of interval and chordal graphs. 
While these discussions are not directly relevant to the remaining part of this 
chapter, we think that they give an insight into the nature of pathwidth and 
treewidth that is invaluable when working with them. 

In Section|7.6]we address the question of how to compute a reasonably good 
tree decomposition of a graph. More precisely, we present an approximation 
algorithm that, given an n-vertex graph G and a parameter k, works in time 
O(8*k? - n?) and either constructs a tree decomposition of G of width at 
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most 4k + 4, or reports correctly that the treewidth of G is more than k. 
Application of this algorithm is usually the first step of the classic win/win 
framework described in the beginning of this section. 

We then move to using treewidth and tree decompositions as tools in 
more involved techniques. Section [7.7] is devoted to applications of treewidth 
for designing FPT algorithms on planar graphs. Most of these applications 
are based on deep structural results about obstacles to admitting good tree 
decompositions. More precisely, in Section we introduce the so-called 
Excluded Grid Theorem and some of its variants, which states that a graph 
of large treewidth contains a large grid as minor. In the case of planar graphs, 
the theorem gives a very tight relation between the treewidth and the size 
of the largest grid minor that can be found in a graph. This fact is then 
exploited in Section where we introduce the powerful framework of 
bidimensionality, using which one can derive parameterized algorithms on 
planar graphs with subexponential parametric dependence of the running 
time. In Section [7.7.3] we discuss the parameterized variant of the shifting 
technique; a reader familiar with basic results on approximation algorithms 
may have seen this method from a different angle. We apply the technique 
to give fixed-parameter tractable algorithms on planar graphs for SUBGRAPH 
ISOMORPHISM and MINIMUM BISECTION. 

In Section we give an FPT algorithm for a problem where a more 
advanced version of the treewidth win/win approach is implemented. More 
precisely, we provide an FPT algorithm for PLANAR VERTEX DELETION, the 
problem of deleting at most k vertices to obtain a planar graph. The crux of 
this algorithm is to design an irrelevant vertex rule: to prove that if a large 
grid minor can be found in the graph, one can identify a vertex that can be 
safely deleted without changing the answer to the problem. This technique is 
very powerful, but also requires attention to many technical details. For this 
reason, some technical steps of the correctness proof are omitted. 

The last part of this chapter, Section [7.9] gives an overview of other graph 
parameters related to treewidth, such as branchwidth and rankwidth. 


7.1 Trees, narrow grids, and dynamic programming 


Imagine that you want to have a party and invite some of your colleagues 
from work to come to your place. When preparing the list of invitations, you 
would like to maximize the total fun factor of the invited people. However, 
from experience you know that there is not much fun when your direct boss 
is also invited. As you want everybody at the party to have fun, you would 
rather avoid such a situation for any of the invited colleagues. 

We model this problem as follows. Assume that job relationships in your 
company are represented by a rooted tree T. Vertices of the tree represent 
your colleagues, and each v € V(T) is assigned a nonnegative weight w(v) 
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that represents the amount of contributed fun for a particular person. The 
task is to find the maximum weight of an independent set in T, that is, of a 
set of pairwise nonadjacent vertices. We shall call this problem WEIGHTED 
INDEPENDENT SET. 

This problem can be easily solved on trees by making use of dynamic 
programming. As usual, we solve a large number of subproblems that depend 
on each other. The answer to the problem shall be the value of a single, top- 
most subproblem. Assume that r is the root of the tree T (which corresponds 
to the superior of all the employees, probably the CEO). For a vertex v of 
T, let T, be the subtree of T rooted at v. For the vertex v we define the 
following two values: 


e Let Afv] be the maximum possible weight of an independent set in T,. 
e Let B[v] be the maximum possible weight of an independent set in T, that 
does not contain v. 


Clearly, the answer to the whole problem is the value of A[r]. 
Values of Afv] and B[v] for leaves of T are equal to w(v) and 0, respectively. 
For other vertices, the values are calculated in a bottom-up order. Assume 


that v1, ...,Uq are the children of v. Then we can use the following recursive 
formulas: a 
Blu] = $ Abe] 
i=1 
and 


Afv] = max { Bio], w(v) + > Biwi}. 


Intuitively, the correctness of these formulas can be explained as follows. We 
know that B[v] stores the maximum possible weight of an independent set 
in T, that does not contain v and, thus, the independent set we are seeking 
is contained in Ty,,...,Z,,- Furthermore, since T, is a tree, there is no edge 
between vertices of two distinct subtrees among Ty,,...,Ty,- This in turn 
implies that the maximum possible weight of an independent set of T, that 
does not contain v is the sum of maximum possible weights of an independent 
set of T,,, i € {1,...,q}. The formula for A[v] is justified by the fact that 
an independent set in T, of the maximum possible weight either contains v, 
which is taken care of by the term w(v) + $>}; B[vi], or does not contain v, 
which is taken care of by the term B[v]. Therefore, what remains to do is to 
calculate the values of A[v] and B[v] in a bottom-up manner in the tree T, and 
finally read the answer from A[r]. This procedure can be clearly implemented 
in linear time. Let us remark that with a slight modification of the algorithm, 
using a standard method of remembering the origin of computed values as 
backlinks, within the same running time one can find not only the maximum 
possible weight, but also the corresponding independent set. 


Let us now try to solve WEIGHTED INDEPENDENT SET on a different class 
of graphs. The problem with trees is that they are inherently “thin”, so let us 
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Fig. 7.1: A subgraph of a 4 x 8 grid, with the third and the fourth columns 
highlighted. When computing c[4,Y] for the forbidden set Y = {(2,4)} 
(crossed out in the fourth column), one of the sets S we consider is S = 
{(1, 4), (4,4)}, depicted with blue circles. Then, when looking at the previous 
column we need to forbid picking neighbors (1,3) and (4,3) (crossed out in 
the third column), since this would violate the independence constraint 


try to look at graphs that are “thicker” in nature, like grids. Since the class 
of grids is not very broad, let us rather focus on subgraphs of grids. More 
precisely, assume that we are given a graph G that is a subgraph of a k x N 
grid. The vertex set of a k x N grid consists of all pairs of the form (i, j) for 
1<i<kand1 <j < N, and two pairs (¢,71) and (é2,j2) are adjacent 
if and only if |¢; — i2| + |j1 — jo| = 1. Graph G is a subgraph of an k x N 
grid, which means that some vertices and edges of the grid can be missing 
in G. Figure [7.1] presents an example of a subgraph of a 4 x 8 grid. In our 
considerations, we will think of the number of rows k as quite small (say, 
k = 10), while N, the number of columns, can be very large (say, N = 10°). 
Of course, since we are trying to solve the WEIGHTED INDEPENDENT SET 
problem, every vertex v € V(G) is assigned its weight w(v). 

Let X; be the j-th column of G, that is, X; = V(G)N{(i,j) : 1<i<k}. 
Moreover, for 1 < j < N, let Gj = G[X, U X2 U... U Xj] be the graph 
induced by the first j columns. We would like to run a dynamic programming- 
algorithm that sweeps the grid from left to right, column by column. In 
the case of trees, we recognized two possible situations that were handled 
differently in the two dynamic-programming tables: either the root of the 
subtree was allowed to be picked into an independent set, or it was forbidden. 
Mimicking this idea, let us define the following function c[j, Y] that we shall 
compute in the algorithm. For Y C X}, we take the following definition: 


c|j, Y] = maximum possible weight of an independent set in G; — Y. 


In other words, we are examining graph Gj, and we look for the best possible 
independent set that avoids picking vertices from Y. We now move on to 
explaining how the values of c[j, Y] will be computed. 
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For j = 1, the situation is very simple. For every Y C Xj, we iterate 
through all possible subsets S C X,\ Y, and for each of them we check 
whether it is an independent set. Then c[1, Y] is the largest weight among 
the candidates (independent sets) that passed this test. Since for each Y C Xı 
we iterate through all possible subsets of X, \ Y, in total, for all the Ys, the 
number of checks is bounded by 


Xıl 


D Ga = 31X < 3k, (7.1) 


£=0 


Here, factor (=) comes from the choice of Y (the sum iterates over £ = |Y |), 
while factor 21X11- represents the number of choices for S C X; \ Y. Since 
every check can be implemented in KO“ time (assuming that for every vertex 
we store a list of its neighbors), the total time spent on computing values c[1, -] 
is BP ROOD, 

We now show how to compute the values of c[j, -] depending on the precom- 
puted values of c[j — 1,-], for j > 1. Let us look at one value c[j, Y] for some 
Y C Xj. Similarly, for j = 1, we should iterate through all the possible ways 
a maximum weight independent set intersects column X;. This intersection 
should be independent, of course, and moreover if we pick some v € X; \ Y 
to the independent set, then this choice forbids choosing its neighbor in the 
previous column X;—; (providing this neighbor exists). But for column X,;_1 
we have precomputed answers for all possible combinations of forbidden ver- 
tices. Hence, we can easily read from the precomputed values what is the best 
possible weight of an extension of the considered intersection with X; to the 
previous columns. All in all, we arrive at the following recursive formula: 


AY] = max, {w(S) +e -L.N(S)NX;a]fs 02 
S is independent 


here w(S) = J „eg w(v). Again, when applying for every Y C X; we 
iterate through all the subsets of X; \ Y. Therefore, as in (7.1) we obtain that 
the total number of sets S checked, for all the sets Y, is at most 3%. Each S 
is processed in kO“) time, so the total time spent on computing values c[j, -] 
gash, 

To wrap up, we first compute the values c[1,-], then iteratively compute 
values c[j,-] for j € {2,3,..., N} using (7.2), and conclude by observing that 
the answer to the problem is equal to c[N, 0]. As argued, each iteration takes 
time 3°. kO), so the whole algorithm runs in time 3°. kO . N. 


Let us now step back and look at the second algorithm that we designed. 
Basically, the only property of G that we really used is that V(G) can be 
partitioned into a sequence of small subsets (columns, in our case), such that 
edges of G can connect only two consecutive subsets. In other words, G has 
a “linear structure of separators”, such that each separator separates the part 
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lying on the left of it from the part on the right. In the algorithm we actually 
used the fact that the columns are disjoint, but this was not that crucial. The 
main point was that only two consecutive columns can interact. 

Let us take a closer look at the choice of the definition of the table c[j, Y]. 
Let I be an independent set in a graph G that is a subgraph of a k x N grid. 
For fixed column number j, we can look at the index Y in the cell clj, Y] 
as the succinct representation of the interaction between 7 N U?_, Xa and 
In a Xa- More precisely, assume that Y = X; N N(I N Xj41) and, 
consequently, the size of C = I N U2_, Xa is one of the candidates for the 
value cj, Y]. Furthermore, let C” be any other candidate for the value c[j, Y], 
that is, let C” be any independent set in G; \Y. Observe that then (I\C)UC’ 
is also an independent set in G. In other words, when going from the column 
j to the column j + 1, the only information we need to remember is the set 
Y of vertices “reserved as potential neighbors”, and, besides the set Y, we do 
not care how exactly the solution looked so far. 

If we now try to lift these ideas to general graphs, then the obvious thing to 
do is to try to merge the algorithms for trees and for subgraphs of grids into 
one. As kx N grids are just “fat” paths, we should define the notion of trees of 
“fatness” k. This is exactly the idea behind the notion of treewidth. In the next 
section we shall first define parameter pathwidth, which encapsulates in a more 
general manner the concepts that we used for the algorithm on subgraphs of 
grids. From pathwidth there will be just one step to the definition of treewidth, 
which also encompasses the case of trees, and which is the main parameter 
we shall be working with in this chapter. 


7.2 Path and tree decompositions 


We have gathered already enough intuition so that we are ready to introduce 
formally the main notions of this chapter, namely path and tree decomposi- 
tions. 


Path decompositions. A path decomposition of a graph G is a sequence 
P = (Xi, X2,..., Xr) of bags, where X; C V(G) for each i € {1,2,...,r}, 
such that the following conditions hold: 


(P1) Us_, Xi = V(G). In other words, every vertex of G is in at least one 
bag. 

(P2) For every uv € E(G), there exists £ € {1,2,...,r} such that the bag 
Xe contains both u and v. 

(P3) For every u € V(G), if u € X; N Xp for some i < k, then u € X; also 
for each j such that i < j < k. In other words, the indices of the bags 
containing u form an interval in {1,2,...,r}. 
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Fig. 7.2: A graph, its path and nice path decompositions 


See Fig. [7.2] for example of a path decomposition. The width of a path de- 
composition (X1, Xo,...,X,) is maXı<i<r |X:|— 1. The pathwidth of a graph 
G, denoted by pw(G), is the minimum possible width of a path decomposi- 
tion of G. The reason for subtracting 1 in the definition of the width of the 
path decomposition is to ensure that the pathwidth of a path with at least 
one edge is 1, not 2. Similarly, we subtract in the definition of treewidth to 
ensure that the treewidth of a tree is 1. 

We can also interpret the bags of a path decomposition as nodes of a path 
and two consecutive bags correspond to two adjacent nodes of the path. This 
interpretation will become handy when we introduce tree decomposition. 

For us the most crucial property of path decompositions is that they define 
a sequence of separators in the graph. In the following, we will say that (A, B) 
is a separation of a graph G if AU B = V(G) and there is no edge between 
A\ Band B\ A. Then AN B is a separator of this separation, and |AN B| 
is the order of the separation. Note that any path in G that begins in A 
and ends in B must contain at least one vertex of the separator A N B. 
Also, for a subset A C V(G) we define the border of A, denoted by (A), as 
the set of those vertices of A that have a neighbor in V(G) \ A. Note that 
(A, (V(G) \ A) U O(A)) is a separation with separator 0(A). Let us remark 
that by Lemma [7.1] each of the bags X; separates vertices in the bags before 
i with the vertices of the bags following after i. 


Lemma 7.1. Let (Xı, X2,..., Xr) be a path decomposition of a graph G. 
Then for every j € {1,...,r — 1} it holds that 0( pa Xi) C X; A Xj- 
In other words, (UZ Xin Uiz Xi) is a separation of G with separator 
X; N Xj41- 
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i=l 
aU Xi) = O(A) C X; N Xj41. Targeting a contradiction, let us assume 
that there is a u € O(A) such that u ¢ Xj NX;j+1. This means that there is an 
edge uv € E(G) such that u € A, v ¢ A but also u ¢ X;MXj41. Let i be the 
largest index such that u € X; and k be the smallest index such that v € Xz. 
Since u € A and u ¢ Xj N Xj+1, (P3) implies that i < j. Since v ¢ A, we 
have also k > j+1. Therefore i < k. On the other hand, by (P2) there should 
be a bag Xe containing both u and v. We obtain that £ < i < k < £, which is 
a contradiction. The fact that AN B = X; N Xj+1 follows immediately from 


(P3). 


Proof. Let us fix j and let (A, B) = (UÍ Xi, Uj=j41 Xi). We first show that 














Note that we can always assume that no two consecutive bags of a 
path decomposition are equal, since removing one of such bags does not 
violate any property of a path decomposition. Thus, a path decomposi- 
tion (X1, X2,..., Xp) of width p naturally defines a sequence of separations 
(Uj21 Xi, Ui=541 Xi). Each of these separations has order at most p, because 
the intersection of two different sets of size at most p + 1 has size at most p. 

We now introduce sort of a “canonical” form of a path decomposition, 
which will be useful in the dynamic-programming algorithms presented in 
later sections. A path decomposition P = (X1, X2,...,X,) of a graph G is 
nice if 
e X, =X, =h, and 
e for every i € {1,2,...,r — 1} there is either a vertex v ¢ X; such that 

Xi41 = X; U {v}, or there is a vertex w € X; such that X;41 = X; \ {w}. 


See Fig. [7.2|for example. 
Bags of the form X;4, = X; U {v} shall be called introduce bags (or in- 


troduce nodes, if we view the path decomposition as a path rather than a 
sequence). Similarly, bags of the form X;,, = X; \ {w} shall be called forget 
bags (forget nodes). We will also say that X;41 introduces v or forgets w. Let 
us note that because of (P3), every vertex of G gets introduced and becomes 
forgotten exactly once in a nice path decomposition, and hence we have that 
r, the total number of bags, is exactly equal to 2|V(G)|+ 1. It turns out that 
every path decomposition can be turned into a nice path decomposition of 
at most the same width. 


Lemma 7.2. If a graph G admits a path decomposition of width at most p, 
then it also admits a nice path decomposition of width at most p. Moreover, 
given a path decomposition P = (X1, X2,..., Xr) of G of width at most p, 
one can in time O(p?-max(r,|V(G)|)) compute a nice path decomposition of 
G of width at most p. 


The reader is asked to prove Lemma [7.2] in Exercise [7.1] 


Tree decompositions. A tree decomposition is a generalization of a path 
decomposition. Formally, a tree decomposition of a graph G is a pair T = 
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(T,{Xt}revr)), where T is a tree whose every node t is assigned a vertex 
subset X, C V(G), called a bag, such that the following three conditions 
hold: 


(T1) Utevery) Xt = V(G). In other words, every vertex of G is in at least 
one bag. 

(T2) For every wv € E(G), there exists a node t of T such that bag X; 
contains both u and v. 

(T3) For every u € V(G), the set T, = {t € V(T) : u € X+}, i.e., the set 
of nodes whose corresponding bags contain u, induces a connected subtree 
of T. 


The width of tree decomposition T = (T, {Xz }revr)) equals maxyey 7) | X+|— 
1, that is, the maximum size of its bag minus 1. The treewidth of a graph G, 
denoted by tw(G), is the minimum possible width of a tree decomposition of 
G. 

To distinguish between the vertices of the decomposition tree T and the 
vertices of the graph G, we will refer to the vertices of T as nodes. As we 
mentioned before, path decompositions correspond exactly to tree decompo- 
sitions with the additional requirement that T has to be a path. 

Let us give several examples of how small or large can be the treewidth 
of particular graph classes. Forests and trees are of treewidth at most 1 and 
cycles have treewidth 2, see Exercise The treewidth of an outerplanar 
graph, which is a graph that can be drawn in the plane in such manner that 
all its vertices are on one face, is at most 2, see Exercise [7.12] On the other 
hand, the treewidth of planar graphs can be arbitrarily large. For example, 
as we will see later, the treewidth of a t x t grid is t. Interestingly, for every 
planar graph H, there is a constant cy, such that for every graph G excluding 
H as a minor, the treewidth of G does not exceed cy, see Exercise [7.36] 

While planar graphs can have arbitrarily large treewidths, as we will see 
later, still the treewidth of an n-vertex planar graph is sublinear, more pre- 
cisely O(,/n). The treewidth of an n-vertex clique Kn is n — 1 and of a 
complete bipartite graph Kn,m is min{m, n} — 1. Expander graphs serve as 
an example of a sparse graph class with treewidth R(n), see Exercise [7.34] 


In Lemma [7.1] we have proved that for every pair of adjacent nodes of a 
path decomposition, the intersection of the corresponding bags is a separator 
that separates the left part of the decomposition from the right part. The 
following lemma establishes a similar separation property of bags of a tree 
decomposition. Its proof is similar to the proof of Lemma [7.1] and is left to 
the reader as Exercise [7.5] 


Lemma 7.3. Let (T,{Xi}iev(r)) be a tree decomposition of a graph G and 
let ab be an edge of T. The forest T — ab obtained from T by deleting edge ab 
consists of two connected components T, (containing a) and T, (containing 
b). Let A= sever, Xt and B = Uievin) Xt- Then O(A), 0(B) E Xa N Xo. 
Equivalently, (A, B) is a separation of G with separator Xa N Xp. 
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Again, note that we can always assume that the bags corresponding to two 
adjacent nodes in a tree decomposition are not the same, since in such sit- 
uation we could contract the edge between them, keeping the same bag in 
the node resulting from the contraction. Thus, if (T, {X+}revcr)) has width 
t, then each of the separations given by Lemma [7.3] has order at most t. 

Similarly to nice path decompositions, we can also define nice tree decom- 
positions of graphs. It will be convenient to think of nice tree decompositions 
as rooted trees. That is, for a tree decomposition (T, {X:}rev(r)) we distin- 
guish one vertex r of T which will be the root of T. This introduces natural 
parent-child and ancestor-descendant relations in the tree T. We will say 
that such a rooted tree decomposition (T, {X:}rev(r)) is nice if the following 
conditions are satisfied: 


e X, = and X; = Ú for every leaf £ of T. In other words, all the leaves as 
well as the root contain empty bags. 
e Every non-leaf node of T is of one of the following three types: 


— Introduce node: a node t with exactly one child t such that X, = 
Xy U {v} for some vertex v ¢ Xy; we say that v is introduced at t. 

— Forget node: a node t with exactly one child t such that X; = Xy \{w} 
for some vertex w € Xy; we say that w is forgotten at t. 

— Join node: a node t with two children t1, t2 such that X; = Xan = Xa- 


At first glance the condition that the root and the leaves contain empty 
bags might seem unnatural. As we will see later, this property helps to 
streamline designing dynamic-programming algorithms on tree decomposi- 
tions, which is the primary motivation for introducing nice tree decomposi- 
tions. Note also that, by property (T3) of a tree decomposition, every vertex 
of V(G) is forgotten only once, but may be introduced several times. 

Also, note that we have assumed that the bags at a join node are equal 
to the bags of the children, sacrificing the previous observation that any 
separation induced by an edge of the tree T is of order at most t. The increase 
of the size of the separators from t to t + 1 has negligible effect on the 
asymptotic running times of the algorithms, while nice tree decompositions 
turn out to be very convenient for describing the details of the algorithms. 

The following result is an analogue of Lemma [7.2|for nice tree decomposi- 
tions. The reader is asked to prove it in Exercise [7.2] 


Lemma 7.4. If a graph G admits a tree decomposition of width at most k, 
then it also admits a nice tree decomposition of width at most k. Moreover, 
given a tree decomposition T = (T,{Xt}tev(r)) of G of width at most k, one 
can in time O(k? - max(|V(T)|,|V(G)|)) compute a nice tree decomposition 
of G of width at most k that has at most O(k|V(G)|) nodes. 


Due to Lemma we will assume that all the nice tree decompositions 
used by our algorithms have O(k|V(G)|) nodes. 
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Note that in a general setting a good path/tree decomposition of an in- 
put graph is not known in advance. Hence, in the dynamic-programming 
algorithms we will always assume that such a decomposition is provided on 
the input together with the graph. For this reason, we will need to address 
separately how to compute its path/tree decomposition of optimum or near- 
optimum width, so that an efficient dynamic-programming algorithm can be 
employed on it. In this book we shall not answer this question for path de- 
compositions and pathwidth. However, in Section [7.6] we present algorithms 
for computing tree decompositions of (approximately) optimum width. 


7.3 Dynamic programming on graphs of bounded 
treewidth 


In this section we give examples of dynamic-programming-based algorithms 
on graphs of bounded treewidth. 


7.38.1 WEIGHTED INDEPENDENT SET 


In Section|7.1}we gave two dynamic-programming routines for the WEIGHTED 
INDEPENDENT SET problem. The first of them worked on trees, and the 
second of them worked on subgraphs of grids. Essentially, in both cases the 
main idea was to define subproblems for parts of graphs separated by small 
separators. In the case of trees, every vertex of a tree separates the subtree 
rooted at it from the rest of the graph. Thus, choices made by the solution 
in the subtree are independent of what happens outside it. The algorithm 
for subgraphs of grids exploited the same principle: every column of the grid 
separates the part of the graph on the left of it from the part on the right. 

It seems natural to combine these ideas for designing algorithms for graphs 
of bounded treewidth. Let us focus on our running example of the WEIGHTED 
INDEPENDENT SET problem, and let T = (T, {Xi}tevir)) be a tree decom- 
position of the input n-vertex graph G that has width at most k. By applying 
Lemma [7.4] we can assume that 7 is a nice tree decomposition. Recall that 
then T is rooted at some node r. For a node t of T, let V; be the union of all 
the bags present in the subtree of T rooted at t, including X+. Provided that 
t Ær we can apply Lemma [7.3] to the edge of T between t and its parent, 
and infer that 0(Vi) C X;. The same conclusion is trivial when t = r, since 
then V, = V(G) and O(V,) = Ø. This exactly formalizes the intuition that 
the subgraph induced by V; can communicate with the rest of the graph only 
via bag X+, which is of small size. 

Extending our intuition from the algorithms of Section [7.1] we would like 
to define subproblems depending on the interaction between the solution and 
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bag X+. Consider the following: Let I1, I2 be two independent sets of G such 
that OX: = IgM X;. Let us add the weights of vertices of J4 and J, that are 
contained in V;, and suppose that it turned out that w(1OV;) > w(IanV;). 
Observe that then solution I> is suboptimal for the following reason. We can 
obtain a solution I} from Ig by replacing I2 N V; with Jı N V;. The fact that 
X; separates V; \ X; from the rest of the graph, and that I, and I> have the 
same intersection with X+, implies that I} is still an independent set. On the 
other hand, w(, N V;) > w(I2 A V;) implies that w(I5) > w(I2). 

Therefore, among independent sets J satisfying I N X; = S for some fixed 
S, all the maximum-weight solutions have exactly the same weight of the part 
contained in V;. This weight corresponds to the maximum possible value for 
the following subproblem: given S C X+, we look for a maximum-weight ex- 
tension § > S such that S CV, Sn X, = S, and S is independent. Indeed, 
the argument from the previous paragraph shows that for any solution J with 
IN X; = S, the part of the solution contained in V; can be safely replaced 
with the best possible partial solution S — this replacement preserves in- 
dependence and can only increase the weight. Observe that the number of 
subproblems is small: for every node t, we have only 2|*+! subproblems. Also, 
we do not need to remember S explicitly; remembering its weight will suffice. 

Hence, we now mimic the bottom-up dynamic programming that we per- 
formed for trees. For every node t and every S C X+, define the following 
value: 


c{t, S] = maximum possible weight of a set 9 such that 
S c SCV, SAX, = S, and S is independent. 


If no such set 9 exists, then we put c[t, S] = —co; note that this happens if 
and only if S is not independent itself. Also c[r,@] is exactly the maximum 
weight of an independent set in G; this is due to the fact that V, = V(G) 
and X, = Í. 

The reader can now observe that this definition of function c[-,-] differs 
from the one used in Section While there we were just forbidding ver- 
tices from some set Y from being used, now we fix exactly how a solution 
is supposed to interact with a bag. Usually, fixing the exact interaction of 
the solution with a bag is a more generic approach to designing dynamic- 
programming algorithms on tree decompositions. However, it must be ad- 
mitted that tweaking the definition slightly (e.g., by relaxing the exactness 
condition to allow or forbid usage of a subset of vertices) often leads to a sim- 
pler description. This is actually the case also in the example of WEIGHTED 
INDEPENDENT SET. In Exercise [7.17|the reader is asked to work out the de- 
tails of a dynamic program with the definition of a state mimicking the one 
used in Section 

We now move on to presenting how the values of c[-,-] are computed. 
Thanks to the definition of a nice tree decomposition, there are only a few 
simple ways in which a bag at some node can relate to the bags of the children 
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of this node. Therefore, if we are fortunate we can compute the values at each 
node t based on the values computed for the children of t. This will be done 
by giving recursive formulas for c[t, S]. The case when t is a leaf corresponds 
to the base case of the recurrence, whereas values for c|-, -] for a non-leaf node 
t depend on the values of c|, -] for the children of t. By applying the formulas 
in a bottom-up manner on T we will finally compute c[r, Ø], which is the value 
we are looking for. 


Leaf node. If t is a leaf node, then we have only one value cft, Ø] = 0. 


Introduce node. Suppose tis an introduce node with child t such that X, = 
Xv U{v} for some v ¢ Xy. Let S be any subset of X+. If S is not independent, 


then we can immediately put c[t,S] = —oo; hence assume otherwise. Then 
we claim that the following formula holds: 
U,S if S; 
aasi A (7.3) 
clt, S\{v}]+w(v) otherwise. 


To prove formally that this formula holds, consider first the case when 
v ¢ S. Then the families of sets S considered in the definitions of c[t, 5] and 
of c[t’, S] are equal, which immediately implies that c[t, S] = c[t’, S]. 

Consider the case when v € S, and let S be a set for which the maximum 
is attained in the definition of cļ|t, S]. Then it follows that 9 \ {v} is one 
of the sets considered in the definition of c[t’,S \ {v}], which implies that 
clt, S \ {v}] > w(S \ {v}) = w(S) — w) = c{t, S] — w(v). Consequently, 


clt, S] < eft’, S \ {v} + w(v). (7.4) 


On the other hand, let S$’ be a set for which the maximum is attained in 
the definition of c[t’,S \ {v}]. Since we assumed that S is independent, we 
have that v does not have any neighbor in S \ {v} = S A Xy. Moreover, by 
Lemma v does not have any neighbors in Vy \ Xv, which is a superset of 
gs \ Xv. We conclude that v does not have any neighbors in S', which means 
that S'U {v} is an independent set. Since this set intersects with X; exactly 
at S, it is considered in the definition of c[t,.S] and we have that 


clt, S] > w(S" U {v} = w(S’) + wlw) = eft’, S \ {v} + w(v). (7.5) 
Concluding, (7.4) and (7.5) together prove (7.3) for the case v € S. 


Forget node. Suppose t is a forget node with child t such that X; = Xv Yw} 
for some w € Xy. Let S be any subset of X;; again we assume that S 
is independent, since otherwise we put c[t,S] = —oo. We claim that the 
following formula holds: 


ct, S] = max {elt S], elt’, SU {w}]}, (7.6) 
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We now give a formal proof of this formula. Let S be a set for which the 
maximum is attained in the definition of c[t, S]. If w ¢ S, then S is one of the 


sets considered in the definition of c[t’, S], and hence c[t’, S] > w(S) = cft, S]. 
However, if w € S then S is one of the sets considered in the definition of 


A 


cit’, SU{w}], and then c[t’, SU{w}] > w(S) = cft, S]. As exactly one of these 
alternatives happens, we can infer that the following holds always: 


eft, S] < max felt’, S], c, SU {w}]}, (7.7) 


On the other hand, observe that each set that is considered in the definition 
of c[t’, S] is also considered in the definition of c[t, S], and the same holds also 
for c[t’, SU {w}]. This means that c[t, S] > c[t’, S] and cft, S] > ct, SU{w}]. 
These two inequalities prove that 


clt, S] > max {elt S], elt’, SU {w}] k (7.8) 


The combination of (7.7) and (7.8) proves that formula (7.6) indeed holds. 


Join node. Finally, suppose that t is a join node with children t,,t2 such 
that X; = Xa = Xn. Let S be any subset of X+; as before, we can assume 
that S is independent. The claimed recursive formula is as follows: 


cft, S] = elti, S] + clt, S] — w(S). (7.9) 


We now prove formally that this formula holds. First take S to be a set for 
which the maximum is attained in the definition of c[t, S]. Let S1 = S N Va 
and RA =§n V- Observe that Sı is independent and Sı N Xa = S, so this 
set is considered in the definition of c{t,, S]. Consequently c{t,,.$] > w(S1), 
and analogously c[tz, S] > w(S2). Since Sı N Sz = S, we obtain the following: 


clt, S] = w(S) = w(i) + w(S2) — w(S) < celti, S] + c[t2, S] — w(S). (7.10) 


On the other hand, let St be a set for which the maximum is attained in the 
definition of clt, S], and similarly define St, for c[t2, 5]. By Lemma [7.3] we 
have that there is no edge between vertices of V} \ X: and Vi, \ X+, which 
implies that the set J := St U St is independent. Moreover FINAX: = S, 
which implies that S’ is one of the sets considered in the definition of cft, S]. 
Consequently, 


clt, S] > w(S") = w(S) + w(S%) — w(S) = elt, S] + clt2, 5] — w(S). (7.11) 


From (7.10) and (7.11) we infer that (7.9) indeed holds. 


This concludes the description and the proof of correctness of the recursive 
formulas for computing the values of c[-,-]. Let us now wrap up the whole 
algorithm and estimate the running time. Recall that we are working on a tree 
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decomposition of width at most k, which means that |X;| < k + 1 for every 
node t. Thus at node t we compute 2!**| < 2¥+1 values of c[t, S]. However, 
we have to be careful when estimating the time needed for computing each 
of these values. 

Of course, we could just say that using the recursive formulas and any 
graph representation we can compute each value cft, S] in nC“ time, but 
then we would end up with an algorithm running in time 2° -n°™. It is easy 
to see that all the operations needed to compute one value can be performed 
in kO time, apart from checking adjacency of a pair of vertices. We need it, 
for example, to verify that a set S is independent. However, a straightforward 
implementation of an adjacency check runs in O(n) time, which would add an 
additional O(n) factor to the running time of the algorithm. Nonetheless, as 
G is a graph of treewidth at most k, it is possible to construct a data structure 
in time kn that allows performing adjacency queries in time O(k). The 
reader is asked to construct such a data structure in Exercise [7.16 

Wrapping up, for every node t it takes time 2* -k°) to compute all the 
values c[t, S]. Since we can assume that the number of nodes of the given 
tree decompositions is O(kn) (see Lemma [7.4}, the total running time of the 
algorithm is 2% -k°) . n. Hence, we obtain the following theorem. 


Theorem 7.5. Let G be an n-verter graph with weights on vertices given 
together with its tree decomposition of width at most k. Then the WEIGHTED 
INDEPENDENT SET problem in G is solvable in time 2" »kO . n. 


Again, using the standard technique of backlinks, i.e., memorizing for every 
cell of table c[-,-] how its value was obtained, we can reconstruct the solution 
(i.e., an independent set with the maximum possible weight) within the same 
asymptotic running time. The same will hold also for all the other dynamic- 
programming algorithms given in this section. 

Since a graph has a vertex cover of size at most £ if and only if it has an 
independent set of size at least n — £, we immediately obtain the following 
corollary. 


Corollary 7.6. Let G be an n-vertex graph given together with its tree de- 
composition of width at most k. Then one can solve the VERTEX COVER 
problem in G in time 2° -kOM . n. 


The algorithm of Theorem [7.5] seems actually quite simple, so it is very 
natural to ask if its running time could be improved. We will see in Chap- 
ter Theorem that under some reasonable assumptions the upper 
bound of E tight. 

The reader might wonder now why we gave all the formal details of this 
dynamic-programming algorithm, even though most of them were straightfor- 
ward. Our point is that despite the naturalness of many dynamic-programming 
algorithms on tree decompositions, proving their correctness formally needs 
a lot of attention and care with regard to the details. We tried to show which 
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implications need to be given in order to obtain a complete and formal proof, 
and what kind of arguments can be used along the way. 


Most often, proving correctness boils down to showing two inequalities 
for each type of node: one relating an optimum solution for the node 
to some solutions for its children, and the second showing the reverse 
correspondence. It is usual that a precise definition of a state of the 
dynamic program together with function c denoting its value already 
suggests natural recursive formulas for c. Proving correctness of these 
formulas is usually a straightforward and tedious task, even though it 
can be technically challenging. 


For this reason, in the next dynamic-programming routines we usually only 
give a precise definition of a state, function c, and the recursive formulas for 
computing c. We resort to presenting a short rationale for why the formulas 
are correct, leaving the full double-implication proof to the reader. We suggest 
that the reader always performs such double-implication proofs for his or her 
dynamic-programming routines, even though all the formal details might not 
always appear in the final write-up of the algorithm. Performing such formal 
proofs can highlight the key arguments needed in the correctness proof, and 
is the best way of uncovering possible problems and mistakes. 

Another issue that could be raised is why we should not go further and 
also estimate the polynomial factor depending on k in the running time of 
the algorithm, which is now stated just as KO“). We refrain from this, since 
the actual value of this factor depends on the following: 


e How fast we can implement all the low-level details of computing formu- 
las for c[t, S], e.g., iteration through subsets of vertices of a bag. This in 
particular depends on how we organize the structure of G and 7 in the 
memory. 

e How fast we can access the exponential-size memory needed for storing 
the dynamic-programming table c[-,-]. 


Answers to these questions depend on low-level details of the implementation 
and on the precise definition of the assumed model of RAM computations. 
While optimization of the KC” factor is an important and a nontrivial prob- 
lem in practice, this issue is beyond the theoretical scope of this book. For 
this reason, we adopt a pragmatic policy of not stating such polynomial fac- 
tors explicitly for dynamic-programming routines. In the bibliographic notes 
we provide some further references to the literature that considers this issue. 
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7.38.2 DOMINATING SET 


Our next example is the DOMINATING SET problem. Recall that a set of 
vertices D is a dominating set in graph G if V(G) = N[D]. The goal is 
to provide a dynamic-programming algorithm on a tree decomposition that 
determines the minimum possible size of a dominating set of the input graph 
G. Again, n will denote the number of vertices of the input graph G, and k is 
an upper bound on the width of the given tree decomposition (T, {X¢}tevr)) 
of G. 

In this algorithm we will use a refined variant of nice tree decompositions. 
In the algorithm of the previous section, for every node t we were essentially 
interested in the best partial solutions in the graph G[V;]. Thus, whenever a 
vertex v was introduced in some node t, we simultaneously introduced also 
all the edges connecting it to other vertices of X+. We will now add a new 
type of a node called an introduce edge node. This modification enables us to 
add edges one by one, which often helps in simplifying the description of the 
algorithm. 

Formally, in this extended version of a nice tree decomposition we have 
both introduce vertex nodes and introduce edge nodes. Introduce vertex 
nodes correspond to introduce nodes in the standard sense, while introduce 
edge nodes are defined as follows. 


e Introduce edge node: a node t, labeled with an edge uv € E(G) such 
that u,v € X,, and with exactly one child t such that X, = Xy. We say 
that edge wv is introduced at t. 


We additionally require that every edge of E(G) is introduced exactly 
once in the whole decomposition. Leaf nodes, forget nodes and join nodes are 
defined just as previously. Given a standard nice tree decomposition, it can 
be easily transformed to this variant as follows. Observe that condition (T3) 
implies that, in a nice tree decomposition, for every vertex v € V(G), there 
exists a unique highest node ¢(v) such that v € Xi»); moreover, the parent 
of X4y) is a forget node that forgets v. Consider an edge uv € E(G), and 
observe that (T2) implies that t(v) is an ancestor of t(u) or t(u) is an ancestor 
of t(v). Without loss of generality assume the former, and observe that we 
may insert the introduce edge bag that introduces uv between t(u) and its 
parent (which forgets u). This transformation, for every edge uv € E(G), 
can be easily implemented in time kOn by a single top-down transversal of 
the tree decomposition. Moreover, the obtained tree decomposition still has 
O(kn) nodes, since a graph of treewidth at most k has at most kn edges (see 
Exercise [7.15}. 

With each node t of the tree decomposition we associate a subgraph G+ of 
G defined as follows: 


G; = (Va, Er = {e : eis introduced in the subtree rooted at t}). 
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While in the previous section the subproblems for t were defined on the graph 
G[V,], now we will define subproblems for the graph G4. 

We are ready to define the subproblems formally. For WEIGHTED IN- 
DEPENDENT SET, we were computing partial solutions according to how a 
maximum weight independent set intersects a bag of the tree decomposition. 
For domination the situation is more complicated. Here we have to distin- 
guish not only if a vertex is in the dominating set or not, but also if it is 
dominated. A coloring of bag X, is a mapping f: X, — {0,0,1} assigning 
three different colors to vertices of the bag. 


e Black, represented by 1. The meaning is that all black vertices have to be 
contained in the partial solution in Gz. 

e White, represented by 0. The meaning is that all white vertices are not 
contained in the partial solution and must be dominated by it. 

e Grey, represented by 0. The meaning is that all grey vertices are not 
contained in the partial solution, but do not have to be dominated by it. 


The reason why we need to distinguish between white and grey vertices is 
that some vertices of a bag can be dominated by vertices or via edges which 
are not introduced so far. Therefore, we also need to consider subproblems 
where some vertices of the bag are not required to be dominated, since such 
subproblems can be essential for constructing the optimum solution. Let us 
stress the fact that we do not forbid grey vertices to be dominated — we just 
do not care whether they are dominated or not. 

For a node t, there are 3!*¢! colorings of X; these colorings form the space 
of states at node t. For a coloring f of X+, we denote by c[t, f] the minimum 
size of a set D C V; such that 


e DNX; = f71(1), which is the set of vertices of X, colored black. 

e Every vertex of V, \ f~1(0) either is in D or is adjacent in G; to a vertex 
of D. That is, D dominates all vertices of V; in graph G;, except possibly 
some grey vertices in Xz. 


We call such a set D a minimum compatible set for t and f. If no minimum 
compatible set for t and f exists, we put c[t, f] = +oo. Note that the size of 
a minimum dominating set in G is exactly the value of c[r, 0] where r is the 
root of the tree decomposition. This is because we have G = G, and X, = Í, 
which means that for X, we have only one coloring: the empty function. 

It will be convenient to use the following notation. For a subset X C V(G), 
consider a coloring f : X — {0,0,1}. For a vertex v € V(G) and a color 
a € {0,0,1} we define a new coloring fosa : X U {v} —> {0,0, 1} as follows: 


f(x) when z £v, 


Qa when x = v. 


fosa(x) = 


For a coloring f of X and Y C X, we use fly to denote the restriction of f 
to Y. 
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We now proceed to present the recursive formulas for the values of c. As 
we mentioned in the previous section, we give only the formulas and short 
arguments for their correctness, leaving the full proof to the reader. 


Leaf node. For a leaf node t we have that X, = Ø. Hence there is only one, 
empty coloring, and we have cft, Ø] = 0. 


Introduce vertex node. Let t be an introduce node with a child t” such 
that X; = Xv U {v} for some v ¢ Xy. Since this node does not introduce 
edges to Gz, the computation will be very simple — v is isolated in G. Hence, 
we just need to be sure that we do not introduce an isolated white vertex, 
since then we have c[t, f] = +oo. That is, for every coloring f of X, we can 
put 


+00 when f(v) = 0, 
elt, f] = ¢ ct, flx] when f(v) = 0, 
1+ eft’, f|x,,] when f(v) =1. 


Introduce edge node. Let t be an introduce edge node labeled with an 
edge uv and let t’ be the child of t. Let f be a coloring of X;. Then sets D 
compatible for t and f should be almost exactly the sets that are compatible 
for t and f, apart from the fact that the edge uv can additionally help in 
domination. That is, if f colors u black and v white, then when taking the 
precomputed solution for t’ we can relax the color of v from white to grey 
— in the solution for t’ we do not need to require any domination constraint 
on v, since v will get dominated by u anyways. The same conclusion can be 
drawn when u is colored white and v is colored black. Therefore, we have the 
following formulas: 


W, fy 46] when (F(u), 
W, fuse) when (F(u), 
ct, f otherwise. 


(v)) = (1,0), 
(v)) = (0,1), 


c f 
elt, f] = ye f 


Forget node. Let t be a forget node with a child t’ such that X; = Xy \ {w} 
for some w € Xy. Note that the definition of compatible sets for t and f 
requires that vertex w be dominated, so every set D compatible for t and f 
is also compatible for t’ and fy+1 (if w € D) or fuso (if w ¢ D). On the 
other hand, every set compatible for t’ and any of these two colorings is also 
compatible for t and f. This justifies the following recursive formula: 


clt, f] = min {elt foi}, elt’, ful}. 


Join node. Let t be a join node with children tı and tə. Recall that X, = 
Xa = Xn. We say that colorings fı of X} and f2 of X+, are consistent with 
a coloring f of X, if for every v € X; the following conditions hold 
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(i) f(v) = 1 if and only if fi(v) = fo(v) =1,. . 

(ii) Flv) = 0 if and only if (fi (v), falv)) € {(0,0),(0,0)}, 
(iii) f(v) = 0 if and only if fı (v) = f2(v) = 0. 
On one hand, if D is a compatible set for f and t, then Dı := D N V, and 
Dz := D AN Va, are compatible sets for tı and fı, and tg and fo, for some 
colorings f1, fo that are consistent with f. Namely, for every vertex v that is 
white in f we make it white either in fı or in f2, depending on whether it is 
dominated by Dı in G;, or by D2 in G+, (if v is dominated by both Dı and 
Dz, then both options are correct). On the other hand, if Dı is compatible for 
tı and fı and Də is compatible for tz and f2, for some colorings fı, fz that are 
consistent with f, then it is easy to see that D := Dı U D2 is compatible for t 
and f. Since for such D1, Da we have that DN X; = D1 A Xan = D2N Xi, = 
f~*(1), it follows that |D| = |D,| + |D2| — |f~+(1)|. Consequently, we can 
infer the following recursive formula: 


clt, f] = min { elti, fal + eft, fal — LF I}, (7.12) 


where the minimum is taken over all colorings fı, f2 consistent with f. 


This finishes the description of the recursive formulas for the values of c. 
Let us analyze the running time of the algorithm. Clearly, the time needed 
to process each leaf node, introduce vertex/edge node or forget node is 3° - 
kO), providing that we again use the data structure for adjacency queries 
of Exercise [7.16] However, computing the values of c in a join node is more 
time consuming. The computation can be implemented as follows. Note that 
if a pair f1, f2 is consistent with f, then for every v € X+ we have 


(f(v), fiw), fa(v)) € {(1, 1, 1), (0, 0,0), (0, 0, 0), (0, 0, 0)}. 


It follows that there are exactly 4!*+! triples of colorings (f, fa, f2) such that 
fi and fz are consistent with f, since for every vertex v we have four possibil- 
ities for (f(v), fi(v), fo(v)). We iterate through all these triples, and for each 
triple (f, fı, f2) we include the contribution from fı, f2 to the value of c[t, f] 
according to (7.12). In other words, we first put c[t, f] = +20 for all colorings 
f of X+, and then for every considered triple (f, fı, fo) we replace the current 
value of c[t, f] with c[t1, fi] + c[te, fo] — |f~1(1)| in case the latter value is 
smaller. As |X;| < k +1, it follows that the algorithm spends 4* -kO) time 
for every join node. Since we assume that the number of nodes in a nice tree 
decomposition is O(kn) (see Exercise|7.15), we derive the following theorem. 


Theorem 7.7. Let G be an n-vertex graph given together with its tree de- 
composition of width at most k. Then one can solve the DOMINATING SET 
problem in G in time 4° -kO - n. 


In Chapter we show how one can use more clever techniques in the 
computations for the join node in order to reduce the exponential dependence 
on the treewidth from 4% to 3*. 
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7.3.8 STEINER TREE 


Our last example is the STEINER TREE problem. We are given an undi- 
rected graph G and a set of vertices K C V(G), called terminals. The goal 
is to find a subtree H of G of the minimum possible size (that is, with the 
minimum possible number of edges) that connects all the terminals. Again, 
assume that n = |V(G)| and that we are given a nice tree decomposition 
T = (T, {Xthevr)) of G of width at most k. We will use the same variant 
of a nice tree decomposition as in Section that is, with introduce edge 
nodes. 

While the exponential dependence on the treewidth in the algorithms we 
discussed so far is single-exponential, for STEINER TREE the situation will be 
different. In what follows, we give an algorithm with running time kO“) - n. 
While a single-exponential algorithm for STEINER TREE actually exists, it 
requires more advanced techniques that will be discussed in Chapter [11] 

In order to make the description of the algorithm simpler, we will make 
one small adjustment to the given decomposition. Let us pick an arbitrary 
terminal u* € K and let us add it to every bag of the decomposition 7. Then 
the width of T increases by at most 1, and bags at the root and at all the 
leaves are equal to {u*}. The idea behind this simple tweak is to make sure 
that every bag of T contains at least one terminal. This will be helpful in the 
definition of the state of the dynamic program. 

Let H be a Steiner tree connecting K and let t be a node of 7. The part. 
of H contained in G; is a forest F with several connected components, see 
Fig. Note that this part is never empty, because X; contains at least one 
terminal. Observe that, since H is connected and X; contains a terminal, each 
connected component of F intersects X;. Moreover, every terminal from KM 
V; should belong to some connected component of F. We try to encode all 
this information by keeping, for each subset X C X, and each partition P of 
X, the minimum size of a forest F in G; such that 


(a) KAV: CV(F), i.e., F spans all terminals from V;, 

(b) V(F)N X: = X, and 

(c) the intersections of X; with vertex sets of connected components of F 
form exactly the partition P of X. 


When we introduce a new vertex or join partial solution (at join nodes), the 
connected components of partial solutions could merge and thus we need to 
keep track of the updated partition into connected components. 

More precisely, we introduce the following function. For a bag X+, a set 
X C X, (a set of vertices touched by a Steiner tree), and a partition P = 
{P,, P2,..., Py} of X, the value c[t,X,P] is the minimum possible number 
of edges of a forest F in G; such that: 


e F has exactly q connected components that can be ordered as C},...,Cq 
so that P, = V(Cs) N X: for each s € {1,...,q}. Thus the partition P 
corresponds to connected components of F. 
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H 
Xt 


Fig. 7.3: Steiner tree H intersecting bag X; and graph G 


e X,0V(F) =X. That is, vertices of X, \ X are untouched by F. 
e Every terminal vertex from K N V; is in V(F). 


A forest F conforming to this definition will be called compatible for (t, X, P). 
If no compatible forest F exists, we put c[t, X, P] = +00. 

Note that the size of an optimum Steiner tree is exactly c[r, {u*}, {{u*}}], 
where r is the root of the decomposition 7. This is because we have that 
Xr = {u*}. We now provide recursive formulas to compute the values of c. 


Leaf node. If t is a leaf node, then X; = {u*}. Since u* € K, we have 
c[t, 0,0] = +00 and cft, {u*}, {{u*}}] = 0. 


Introduce vertex node. Suppose that ¢ is an introduce vertex node with 
a child ¢’ such that X; = Xv U {v} for some v ¢ Xv. Recall that we have 
not introduced any edges adjacent to v so far, so v is isolated in G;. Hence, 
for every set X C X, and partition P = {P}, P2,...,P,} of X we do the 
following. If v is a terminal, then it has to be in X. Moreover, if v is in X, 
then {v} should be a block of P, that is, v should be in its own connected 
component. If any of these conditions is not satisfied, we put cft, X, P] = +oo. 
Otherwise we have the following recursive formula: 


[!,X\{u},P\{{o}}] ifue x, 


c 
t, X, P] = : 
Al T X, P] otherwise. 


Introduce edge node. Suppose that t is an introduce edge node that in- 
troduces an edge uv, and let t’ be the child of t. For every set X C X; and 
partition P = {P}, P2,...,P,} of X we consider three cases. If u ¢ X or 
v ¢ X, then we cannot include uv into the tree under construction, as one 
of its endpoints has been already determined not to be touched by the tree. 
Hence in this case cft, X, P] = ct’, X, P|]. The same happens if u and v are 
both in X, but are not in the same block of P. Assume then that u and v are 
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Fig. 7.4: Undesirable merge of partial solutions in a join node 


both in X and they actually are in the same block of P. Then the edge uv 
either has been picked to the solution, or has not been picked. If not, then 
we should look at the same partition P at t’, and otherwise the block of u 
and v in P should have been obtained from merging two smaller blocks, one 
containing u and the second containing v. Hence 


clt, X, P] = min { min cit, X, P'] + 1, elt’, X, P] k 


where in the inner minimum we consider all partitions P’ of X in which u 
and v are in separate blocks (as otherwise adding uv would create a cycle) 
such that after merging the blocks of u and v we obtain the partition P. 


Forget node. Suppose that t is a forget node with a child t’ such that 
X, = Xv \ {w} for some w € Xv. Consider any set X C X; and partition 
P = {P}, Po,...,P,} of X. The solution for X and P might either use the 
vertex w, in which case it should be added to one of existing blocks of P, or 
not use it, in which case we should simply look on the same partition of the 
same X. Hence 


clt, X, P] = min { min cft’, X U {w},P'] elt’, X, PI}, 


where the inner minimum is taken over all partitions P’ of X U {w} that are 
obtained from P by adding w to one of the existing blocks. 


Join node. Suppose t is a join node with children tı and tə. Recall that 
then X; = Xa = Xa. In essence, in the computation for t we need to 
encode merging two partial solutions: one originating from G+, and the second 
originating from G;,. When merging two partial solutions, however, we have 
to be careful because such a merge can create cycles, see Fig. |7.4 
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To avoid cycles while merging, we introduce an auxiliary structure[!|For a 
partition P of X let Gp be a forest with a vertex set X, such that the set of 
connected components in Gp corresponds exactly to P. In other words, for 
each block of P there is a tree in Gp with the same vertex set. We say that a 
partition P = {P}, Po,...,P,} of X is an acyclic merge of partitions Pı and 
P2 if the merge of two forests Gp, and Gp, (treated as a multigraph) is a 
forest whose family of connected components is exactly P. 

Thus we have the following formula: 


clt, X, P] = pon elt, X, Pı] + elte, X, Pa], 


where in the minimum we consider all pairs of partitions P1, Pa such that P 
is an acyclic merge of them. 


This concludes the description of the recursive formulas for the values 
of c. We proceed to estimate the running time. Recall that every bag of 
the decomposition has size at most k + 2. Hence, the number of states per 
node is at most 2*+? . (k + 2)*+? = kO(*), since for a node t there are 2!*«| 
subsets X C X, and at most |X|!*! partitions of X. The computation of a 
value for every state requires considering at most all the pairs of states for 
some other nodes, which means that each value can be computed in time 
(kO(%))? = kO). Thus, up to a factor polynomial in k, which is anyhow 
dominated by the O-notation in the exponent, for every node the running 
time of computing the values of c is kO‘*). We conclude with the following 
theorem. 


Theorem 7.8. Let G be an n-vertez graph, let K C V(G) be a given set of 
terminals, and assume that G is given together with its tree decomposition of 
width at most k. Then one can find the minimum possible number of edges 
of a Steiner tree connecting K in time kO) . n 


Algorithms similar to the dynamic programming of Theorem [7.8] can be 
used to solve many problems in time kO“™) . nO), Essentially, such a running 
time appears for problems with connectivity requirements, since then it is 
natural to keep in the dynamic programming state a partition of a subset 
of the bag. Since the number of such partitions is at most kO(*), this factor 
appears naturally in the running time. 

For convenience, we now state formally the most prominent problems that 
can be solved in single-exponential time when parameterized by treewidth 
(i.e., in time 20™) .n°™), and those that can be solved in slightly super- 
exponential time (i.e., KO“) -n°™). We leave designing the remaining al- 
gorithms from the following theorems as two exercises: Exercise and 
Exercise [7.19 





1 One could avoid the cycle detection by relaxing the definition of c[t, X, P] and dropping 
the assumption that F is a forest. However, as in other problems, like FEEDBACK VERTEX 
SET, cycle checking is essential, we show a common solution here. 
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Let us recall that the MAxCuT problem asks for a partition of V(G) into 
sets A and B such that the number of edges between A and B is maximized. 
The g-COLORING problem asks whether G can be properly colored using q 
colors, while in CHROMATIC NUMBER the question is to find the minimum 
possible number of colors needed to properly color G. LONGEST PATH and 
LONGEST CYCLE ask for the existence of a path/cycle on at least ¢ vertices 
in G, for a given integer @. Similarly, in CYCLE PACKING the question is 
whether one can find £ vertex-disjoint cycles in G. Problems CONNECTED 
VERTEX COVER, CONNECTED DOMINATING SET, CONNECTED FEEDBACK 
VERTEX SET differ from their standard (non-connected) variants by addition- 
ally requiring that the solution vertex cover /dominating set /feedback vertex 
set induces a connected graph in G. 


Theorem 7.9. Let G be an n-vertex graph given together with its tree de- 
composition of width at most k. Then in G one can solve 


VERTEX COVER and INDEPENDENT SET in time 2% -kO™ - n, 
DOMINATING SET in time 4® -kOM - n, 

ODD CYCLE TRANSVERSAL in time 3° -kO™ . n, 

MAXxCutT in time 2% -kOM . n, 

q-COLORING in time q" -kO - n. 


Theorem 7.10. Let G be an n-vertex graph given together with its tree de- 
composition of width at most k. Then one can solve each of the following 
problems in G in time kO) - n: 


STEINER TREE, 

FEEDBACK VERTEX SET, 

HAMILTONIAN PATH and LONGEST PATH, 
HAMILTONIAN CYCLE and LONGEST CYCLE, 
CHROMATIC NUMBER, 

CYCLE PACKING, 

CONNECTED VERTEX COVER, 

CONNECTED DOMINATING SET, 
CONNECTED FEEDBACK VERTEX SET. 





Let us also note that the dependence on k for many of the problems listed 
in Theorem [7.10] will be improved to single-exponential in Chapter [11] Also, 
the running time of the algorithm for DOMINATING SET will be improved 
from 4* - kO® . n to 3%. kOM n. 


As the reader probably observed, dynamic-programming algorithms on 
graphs of bounded treewidth are very similar to each other. 
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The main challenge for most of the problems is to understand what 
information to store at nodes of the tree decomposition. Obtaining for- 
mulas for forget, introduce and join nodes can be a tedious task, but is 
usually straightforward once a precise definition of a state is established. 


It is also worth giving an example of a problem which cannot be solved 
efficiently on graphs of small treewidth. In the STEINER FOREST problem we 
are given a graph G and a set of pairs (s1,t1),..., (Sp, tp»). The task is to find 
a minimum subgraph F of G such that each of the pairs is connected in F. 
STEINER TREE is a special case of STEINER FOREST with sı = --- = sp- 
The intuition behind why a standard dynamic programming approach on 
graphs of constant treewidth does not work for this problem is as follows. 
Suppose that we have a set of vertices S = {s;}1<i<p separated from vertices 
T = {ti}i<i<p by a bag of tree decomposition of size 2. Since all paths from 
S to T must pass through the bag of size 2, the final solution F contains 
at most two connected components. Any partial solution divides the vertices 
of S into two parts based on which vertex in the separating bag they are 
connected to. Thus it seems that to keep track of all partial solutions for the 
problem, we have to compute all possible ways the set S$ can be partitioned 
into two subsets corresponding to connected components of F, which is 2?. 
Since p does not depend on the treewidth of G, we are in trouble. In fact, 
this intuition is supported by the fact that STEINER FOREST is NP-hard on 
graphs of treewidth at most 3, see [26]. In Section [13.6] we also give examples 
of problems which are W[1|-hard parameterized by the treewidth of the input 
graph. 


Finding faster dynamic-programming strategies can be an interesting chal- 
lenge and we will discuss several nontrivial examples of such algorithms in 
Chapter However, if one is not particularly interested in obtaining the 
best possible running time, then there exist meta-techniques for designing 
dynamic-programming algorithms on tree decompositions. These tools we 
discuss in the next section. 


7.4 Treewidth and monadic second-order logic 


As we have seen in the previous sections, many optimization problems 
are fixed-parameter tractable when parameterized by the treewidth. Algo- 
rithms for this parameterization are in the vast majority derived using the 
paradigm of dynamic programming. One needs to understand how to suc- 
cinctly represent necessary information about a subtree of the decomposition 
in a dynamic-programming table. If the size of this table turns out to be 
bounded by a function of the treewidth only, possibly with some polynomial 
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factors in the total graph size, then there is hope that a bottom-up dynamic- 
programming procedure can compute the complete answer to the problem. In 
other words, the final outcome can be obtained by consecutively assembling 
information about the behavior of the problem in larger and larger subtrees 
of the decomposition. 

The standard approach to formalizing this concept is via tree automata. 
This leads to a deep theory linking logic on graphs, tree automata, and 
treewidth. In this section, we touch only the surface of this subject by high- 
lighting the most important results, namely Courcelle’s theorem and its op- 
timization variant. 

Intuitively, Courcelle’s theorem provides a unified description of properties 
of a problem that make it amenable to dynamic programming over a tree de- 
composition. This description comes via a form of a logical formalism called 
Monadic Second-Order logic on graphs. Slightly more precisely, the theorem 
and its variants state that problems expressible in this formalism are always 
fixed-parameter tractable when parameterized by treewidth. Before we pro- 
ceed to stating Courcelle’s theorem formally, we need to understand first how 
Monadic Second-Order logic on graphs works. 


7.4.1 Monadic second-order logic on graphs 


MSO, for dummies. The logic we are about to introduce is called MSO». 
Instead of providing immediately the formal description of this logic, we first 
give an example of an MSO» formula in order to work out the main concepts. 
Consider the following formula conn(X), which verifies that a subset X of 
vertices of a graph G = (V, E) induces a connected subgraph. 








conn(X) = Vycv[(duex uE Y AdAvexv ¢ Y) 
=> (deer dJuex wex inc(u,e) Aine(v,e)AUEYAVEY)). 

















Now, we rewrite this formula in English. 


For every subset of vertices Y, if X contains both a vertex from Y and a 
vertex outside of Y, then there exists an edge e whose endpoints u,v both 
belong to X, but one of them is in Y and the other is outside of Y. 


One can easily see that this condition is equivalent to the connectivity of 
G[X]: the vertex set of G cannot be partitioned into Y and V(G)\ Y in such 
a manner that X is partitioned nontrivially and no edge of G[X] crosses the 
partition. 

As we see on this example, MSO» is a formal language of expressing 
properties of graphs and objects inside these graphs, such as vertices, edges, 
or subsets of them. A formula y of MSO, is nothing else but a string over 
some mysterious symbols, which we shall decode in the next few paragraphs. 
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One may think that a formula defines a program that can be run on an input 
graph, similarly as, say, a C++ program can be run on some text input P] A 
C++ program is just a sequence of instructions following some syntax, and an 
MSO, formula is just a sequence of symbols constructed using a specified 
set of rules. A C++ program can be run on multiple different inputs, and may 
provide different results of the computation. Similarly, an MSO» formula 
may be evaluated in different graphs, and it can give different outcomes. 
More precisely, an MSO,» formula can be true in a graph, or false. The result 
of an application of a formula to a graph will be called the evaluation of the 
formula in the graph. 

Similarly to C++ programs, MSO» formulas have variables which represent 
different objects in the graph. Generally, we shall have four types of variables: 
variables for single vertices, for single edges, for subsets of vertices, and for 
subsets of edges; the last type was not used in formula conn(X). At each 
point of the process of evaluation of the formula, every variable is evaluated 
to some object of appropriate type. 

Note that a formula can have “parameters”: variables that are given from 
“outside”, whose properties we verify in the graph. In the conn(X) example 
such a parameter is X, the vertex subset whose connectivity is being tested. 
Such variables will be called free variables of the formula. Note that in order to 
properly evaluate the formula in a graph, we need to be given the evaluation of 
these variables. Most often, we will assume that the input graph is equipped 
with evaluation of all the free variables of the considered MSO» formula, 
which means that these evaluations are provided together with the graph. 

If we already have some variables in the formula, we can test their mutual 
interaction. As we have seen in the conn(X) example, we can for instance 
check whether some vertex u belongs to some vertex subset Y (u € Y), or 
whether an edge e is incident to a vertex u (inc(u,e)). These checks can 
be combined using standard Boolean operators such as ~ (negation, logical 
NOT), A (conjunction, logical AND), V (disjunction, logical OR), = (impli- 
cation). 

The crucial concept that makes MSO» useful for expressing graph prop- 
erties are quantifiers. They can be seen as counterparts of loops in standard 
programming languages. We have two types of quantifiers, V and 4. Each 
quantifier is applied to some subformula Y, which in the programming lan- 
guage analogy is just a block of code bound by the loop. Moreover, every 
quantifier introduces a new variable over which it iterates. This variable can 
be then used in the subformula. 

Quantifier V is called the universal quantifier. Suppose we write a formula 
Vuev Y, where w is some subformula that uses variable v. This formula should 
be then read as “For every vertex v in the graph, w holds.” In other words, 
quantifier V,ey iterates through all possible evaluations of variable v to a 
vertex of the graph, and for each of them it is checked whether ~ is indeed 








2 For a reader familiar with the paradigm of functional programming (languages like Lisp 
or Haskell), an analogy with any functional language would be more appropriate. 
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true. If this is the case for every evaluation of v, then the whole formula 
Vuev w is true; otherwise it is false. 

Quantifier J, called the existential quantifier, works sort of similarly. For- 
mula dyev Y should be read as “There exists a vertex v in the graph, such 
that w holds.” This means that J„ev iterates through all possible evaluations 
of variable v to a vertex of the graph, and verifies whether there is at least 
one for which w is true. 

Of course, here we just showed examples of quantification over variables 
for single vertices, but we can also quantify over variables for single edges 
(e.g., Veew/deer), vertex subsets (e.g., Vxcv/dxcv), or edge subsets (e.g., 
Vocr/dccez). Standard Boolean operators can be also used to combine larger 
formulas; see for instance our use of the implication in formula conn(X). 

We hope that the reader already understands the basic idea of MSO, as a 
(programming) language for expressing graph properties. We now proceed to 
explaining formally the syntax of MSO» (how formulas can be constructed), 
and the semantics (how formulas are evaluated). Fortunately, they are much 
simpler than for C++. 




















Syntax and semantics of MSO». Formulas of MSO,» can use four types 
of variables: for single vertices, single edges, subsets of vertices, and subsets 
of edges. The subscript 2 in MSO» exactly signifies that quantification over 
edge subsets is also allowed. If we forbid this type of quantification, we arrive 
at a weaker logic MSO}. The vertex/edge subset variables are called monadic 
variables. 

Every formula y of MSO» can have free variables, which often will be writ- 
ten in parentheses besides the formula. More precisely, whenever we write a 
formula of MSOz2, we should always keep in mind what variables are as- 
sumed to be existent in the context in which this formula will be used. The 
sequence of these variables is called the signature over which the formula is 
written}? following our programming language analogy, this is the environ- 
ment in which the formula is being defined. Then variables from the signature 
can be used in g as free variables. The signature will be denoted by X. Note 
that for every variable in the signature we need to know what is its type. 

In order to evaluate a formula y over signature X in a graph G, we need 
to know how the variables of X are evaluated in G. By XC we will denote the 
sequence of evaluations of variables from X. Evaluation of a single variable x 
will be denoted by 7°. Graph G and 7° together shall be called the structure 
in which ¢ is being evaluated. If ọ is true in structure (G, XC), then we shall 
denote it by 

(G, 5°) E y, 





3 A reader familiar with the foundations of logic in computer science will certainly see 
that we are slightly tweaking some definitions so that they fit to our small example. We 
do it in order to simplify the description of MSO2 while maintaining basic compliance 
with the general literature. In the bibliographic notes we provide pointers to more rigorous 
introductions to logic in computer science, where the notions of signature and structure 
are introduced properly. 
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which should be read as “Structure (G, Y@) is a model for ¢.” 
Formulas of MSO, are constructed inductively from smaller subformulas. 
We first describe the smallest building blocks, called atomic formulas. 


e If wu € Xis a vertex (edge) variable and X € X is a vertex (edge) set 
variable, then we can write formula u € X. The semantics is standard: the 
formula is true if and only if uf € X°. 

e If u € Xis a vertex variable and e € X is an edge variable, then we can 
write formula inc(u,e). The semantics is that the formula is true if and 
only if u@ is an endpoint of e®. 

e For any two variables x,y € X of the same type, we can write formula 
x = y. This formula is true in the structure if and only if zë = y%. 


Now that we know the basic building blocks, we can start to create larger 
formulas. As described before, we can use standard Boolean operators ~=, ^, 
V, => working as follows. Suppose that %1, y2 are two formulas over the same 
signature X. Then we can write the following formulas, also over X 


Formula =y, where (G, X) = ~g; if and only if (G, XC) Æ q1. 
Formula y1 A p2, where (G, XC) H g1 A g2 if and only if (G, 2°) H yı 
and (G, XS) H wo. 
e Formula %1 V p2, where (G, XC) H y1 V ọ2 if and only if (G, 2°) K yy 
or (G, XC) = oo. 
e Formula y1 > p2, where (G, XC) = y1 > g2 if and only if (G, XC) H yy 
implies that (G, XC) = go. 














Finally, we can use quantifiers. For concreteness, suppose we have a 
formula 7 over signature X’ that contains some vertex variable v. Let 
X = &” \ {v}. Then we can write the following formulas over X: 


e Formula yy = Vsev Y. Then (G, 5°) H wy if and only if for every vertex 
v € V(G), it holds that (G, XC, v) = y. 

e Formula y3 = Jvev Y. Then (G, XC) H gs if and only if there exists a 
vertex v@ € V (G) such that (G, XC, v%) 5 y. 











Similarly, we can perform quantification over variables for single edges 
(Vecer/JecE), vertex subsets (Vxcv/dxcv), and edge subsets (Vece/dcce)- 
The semantics is defined analogously. 

Observe that in formula conn(X) we used a couple of notation “hacks” 
that simplified the formula, but were formally not compliant to the syntax de- 
scribed above. We namely allow some shorthands to streamline writing formu- 
las. Firstly, we allow simple shortcuts in the quantifiers. For instance, Jye x w 
is equivalent to dycy (v € X) Aw and Vyex Y is equivalent to Yyey (v € 
X) = vw. We can also merge a number of similar quantifiers into one, e.g., 
dx,,x.cv is the same as dy,cvydx.cv- Another construct that we can use is 
the subset relation X C Y: it can be expressed as Vyev(u € X) > (v EY), 
and similarly for edge subsets. We can also express the adjacency relation be- 
tween two vertex variables: adj(u, v) = (u # v) A (Acer ine(u, e) A inc(v, e)). 
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Finally, we use x 4 y for —(a = y) and x ¢ X for a(x € X). The reader is 
encouraged to use his or her own shorthands whenever it is beneficial. 


Examples. Let us now provide two more complicated examples of graph 
properties expressible in MSOz. We have already seen how to express that a 
subset of vertices induces a connected graph. Let us now look at 3-colorability. 
To express this property, we need to quantify the existence of three vertex 
subsets X1, X2, X3 which form a partition of V, and where each of them is 
an independent set. 





3colorability = 4x,,x,,x,cv partition(X), X2, X3)A 
indp(X1) A indp(X2) A indp(X3). 


Here, partition and indp are two auxiliary subformulas. Formula partition 
has three vertex subset variables X1, X2, X3 and verifies that (X1, X2, X3) is 
a partition of the vertex set V. Formula indp verifies that a given subset of 
vertices is independent. 


partition(X),X2,X3)= Vrev |w € Xi Av € X2 Av € X3) 
Viug Xı Av E€ X2 Av £ X3) 
Viug Xi Avg Xə Av E€ X3); 


indp(X) = Vu, vex ~adj(u, v). 


Second, let us look at Hamiltonicity: we would like to write a formula that 
is true in a graph G if and only if G admits a Hamiltonian cycle. For this, 
let us quantify existentially a subset of edges C that is supposed to comprise 
the edges of the Hamiltonian cycle we look for. Then we need to verify that 
(a) C induces a connected graph, and (b) every vertex of V is adjacent to 
exactly two different edges of C. 





hamiltonicity = doc” connE(C) A Vrev deg2(v, C). 


Here, connE(C) is an auxiliary formula that checks whether the graph (V, C) 
is connected (using similar ideas as for conn(X)), and deg2(v,C) verifies 
that vertex v has exactly two adjacent edges belonging to C: 








connE(C)= Vycv [(duevu€ Y¥Adevv éY) 


= (Heec JueY JvgY inc(u, e) ^ inc(v, e))]; 




















deg2(v,C) = Je, esec [(e1  €2) A inc(v, e1) A inc(v, e2)^ 
(Vesec inc(v, e3) = (e1 = e3 V e2 = €3))]. 
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7.4.2 Courcelle’s theorem 


In the following, for a formula ¢ by |||| we denote the length of the encoding 
of ọ as a string. 


Theorem 7.11 (Courcelle’s theorem, [98]). Assume that p is a formula 
of MSO, and G is an n-vertex graph equipped with evaluation of all the free 
variables of p. Suppose, moreover, that a tree decomposition of G of width t 
is provided. Then there exists an algorithm that verifies whether ọ is satisfied 
in G in time f(||p||,t)- n, for some computable function f. 


The proof of Courcelle’s theorem is beyond the scope of this book, and 
we refer to other sources for a comprehensive presentation. As we will see 
later, the requirement that G be given together with its tree decomposition 
is not necessary, since an optimal tree decomposition of G can be computed 
within the same complexity bounds. Algorithms computing treewidth will be 
discussed in the next section and in the bibliographic notes. 

Recall that in the previous section we constructed formulas 3colorability 
and hamiltonicity that are satisfied in G if and only if G is 3-colorable 
or has a Hamiltonian cycle, respectively. If we now apply Courcelle’s theo- 
rem to these constant-size formulas, we immediately obtain as a corollary 
that testing these two properties of graphs is fixed-parameter tractable when 
parameterized by treewidth. 

Let us now focus on the VERTEX COVER problem: given a graph G and 
integer k, we would like to verify whether G admits a vertex cover of size at 
most k. The natural way of expressing this property in MSO» is to quantify 
existentially k vertex variables, representing vertices of the vertex cover, and 
then verify that every edge of G has one of the quantified vertices as an 
endpoint. However, observe that the length of such a formula depends linearly 
on k. This means that a direct application of Courcelle’s theorem gives only 
an f(k,t)-n algorithm, and not an f(t)-n algorithm as was the case for the 
dynamic-programming routine of Corollary [7.6] Note that the existence of an 
f(k,t) -n algorithm is only a very weak conclusion, because as we already 
have seen in Section even the simplest branching algorithm for VERTEX 
COVER runs in time O(2* . (n + m)). 

Therefore, we would rather have the following optimization variant of 
the theorem. Formula y has some free monadic (vertex or edge) variables 
X1,X2,...,Xp, which correspond to the sets we seek in the graph. In 
the VERTEX COVER example we would have one vertex subset variable 
X that represents the vertex cover. Formula y verifies that the variables 
X1, X2,..., Xp satisfy all the requested properties; for instance, that X in- 
deed covers every edge of the graph. Then the problem is to find an evalua- 
tion of variables X1, X2,...,X, that minimizes/maximizes the value of some 
arithmetic expression a(|X;|,|X2|,...,|Xp|) depending on the cardinalities 
of these sets, subject to y(X1, X2,..., Xp) being true. We will focus on a 
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being an affine function, that is, a(£1, £2, ..., £p) = ao + > >_, Qiz; for some 
a0, @1,..-,@) E R. 

The following theorem states that such an optimization version of Cour- 
celle’s theorem indeed holds. 


Theorem 7.12 ([19]). Let y be an MSO, formula with p free monadic vari- 
ables X1, X2,..., Xp, and let a(£1, £2,..., £p) be an affine function. Assume 
that we are given an n-vertex graph G together with its tree decomposition 
of width t, and suppose G is equipped with evaluation of all the free vari- 
ables of p apart from X1, X2,..., Xp- Then there exists an algorithm that in 
Foll, t) on finds the minimum and maximum value of a(|Xı|, | X2], -- - , |Xpl) 
for sets X,,X2,...,Xp for which p(X1, X2,..., Xp) is true, where f is some 
computable function. 


To conclude our VERTEX COVER example, we can now write a simple 
constant-length formula vcover(X) that verifies that X is a vertex cover 
of G: veover(X) = Veer Irex inc(x,e). Then we can apply Theorem 
to vcover and a(|X|) = |X|, and infer that finding the minimum cardinality 
of a vertex cover can be done in f(t)-n time, for some function f. 

Note that both in Theorem[7.11]and in Theorem [7.12]we allow the formula 
to have some additional free variables, whose evaluation is provided together 
with the graph. This feature can be very useful whenever in the considered 
problem the graph comes together with some predefined objects, e.g., ter- 
minals in the STEINER TREE problem. Observe that we can easily write an 
MSO, formula Steiner(K, F) for a vertex set variable K and edge set vari- 
able F, which is true if and only if the edges from F form a Steiner tree 
connecting K. Then we can apply Theorem [7.12|to minimize the cardinality 
of F subject to Steiner(K, F) being true, where the vertex subset K is given 
together with the input graph. Thus we can basically re-prove Theorem [7.8] 
however without any explicit bound on the running time. 

To conclude, let us deliberate briefly on the function f in the bound on 
the running time of algorithms provided by Theorems [7.11] and Un- 
fortunately, it can be proved that this function has to be nonelementary; in 
simple words, it cannot by bounded by a folded c times exponential func- 
tion for any constant c. Generally, the main reason why the running time 
must be so high is the possibility of having alternating sequences of quanti- 
fiers in the formula y. Slightly more precisely, we can define the quantifier 
alternation of a formula y to be the maximum length of an alternating se- 
quence of nested quantifiers in y, i.e., VIVA... (we omit some technicalities 
in this definition). Then it can be argued that formulas of quantifier alter- 
nation at most q give rise to algorithms with at most c-times exponential 
function f, where c depends linearly on q. However, tracing the exact bound 
on f even for simple formulas ọ is generally very hard, and depends on the 
actual proof of the theorem that is used. This exact bound is also likely to 
be much higher than optimal. For this reason, Courcelle’s theorem and its 
variants should be regarded primarily as classification tools, whereas design- 
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ing efficient dynamic-programming routines on tree decompositions requires 
“getting your hands dirty” and constructing the algorithm explicitly. 


7.5 Graph searching, interval and chordal graphs 


In this section we discuss alternative interpretations of treewidth and path- 
width, connected to the concept of graph searching, and to the classes of 
interval and chordal graphs. We also provide some tools for certifying that 
treewidth and pathwidth of a graph is at least /at most some value. While 
these results are not directly related to the algorithmic topics discussed in 
the book, they provide combinatorial intuition about the considered graph 
parameters that can be helpful when working with them. 


Alternative characterizations of pathwidth. We start with the concept 
of graph searching. Suppose that G is a graph representing a network of tun- 
nels where an agile and omniscient fugitive with unbounded speed is hiding. 
The network is being searched by a team of searchers, whose goal is to find 
the fugitive. A move of the search team can be one of the following: 


e Placement: We can take a searcher from the pool of free searchers and 
place her on some vertex v. 

e Removal: We can remove a searcher from a vertex v and put her back to 
the pool of free searchers. 


Initially all the searchers are free. A search program is a sequence of moves 
of searchers. While the searchers perform their program, the fugitive, who 
is not visible to them, can move between the nodes of the network at un- 
bounded speed. She cannot, however, pass through the vertices occupied by 
the searchers, and is caught if a searcher is placed on a vertex she currently oc- 
cupies. Searchers win if they launch a search program that guarantees catch- 
ing the fugitive. The fugitive wins if she can escape the searchers indefinitely. 
Another interpretation of the same game is that searchers are cleaning a 
network of tunnels contaminated with a poisonous gas. Initially all the edges 
(tunnels) are contaminated, and an edge becomes clear if both its endpoints 
are occupied by searchers. The gas, however, spreads immediately between 
edges through vertices that are not occupied by any searcher. If a cleaned edge 
becomes contaminated in this manner, we say that it is recontaminated. The 
goal of the searchers is to clean the whole network using a search program. 
These interpretations can be easily seen to be equivalent f] and they lead 
to the following definition of a graph parameter. The node search number of 
a graph G is the minimum number of searchers required to clean G from gas 





4 Strictly speaking, there is a boring corner case of nonempty edgeless graph, where no gas 
is present but one needs a searcher to visit one by one all vertices to catch the fugitive. 
In what follows we ignore this corner case, as all further results consider only a positive 
number of searchers. 
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or, equivalently, to catch an invisible fugitive in G. For concreteness, from 
now on we will work with the gas cleaning interpretation. 

An important question in graph searching is the following: if k searchers 
can clean a graph, can they do it in a monotone way, i.e., in such a way that 
no recontamination occurs? The answer is given by the following theorem of 
LaPaugh; its proof lies beyond the scope of this book. 


Theorem 7.13 ([315]). For any graph G, if k searchers can clear G, then 
k searchers can clear G in such a manner that no edge gets recontaminated. 


Let us now proceed to the class of interval graphs. A graph G is an interval 
graph if and only if one can associate with each vertex v € V(G) a closed 
interval I, = [ly,Tv], ly < ru, on the real line, such that for all u,v € V(G), 
u Æ v, we have that uv € E(G) if and only if I, I, 4 0. The family 
of intervals Z = {I,}yey is called an interval representation or an interval 
model of G. By applying simple transformations to the interval representation 
at hand, it is easy to see that every interval graph on n vertices has an interval 
representation in which the left endpoints are distinct integers 1,2,...,n. We 
will call such a representation canonical. 

Recall that a graph G” is a supergraph of a graph G if V(G) C V(G’) and 
E(G) C E(G’). We define the interval width of a graph G as the minimum 
over all interval supergraphs G” of G of the maximum clique size in G”. That 
is 


interval-width(G) = min {w(G’) : G C G’ AG’ is an interval graph} . 


Here, w(G’) denotes the maximum size of a clique in G”. In other words, the 
interval width of G is at most k if and only if there is an interval supergraph 
of G with the maximum clique size at most k. Note that in this definition we 
can assume that V(G’) = V(G), since the class of interval graphs is closed 
under taking induced subgraphs. 

We are ready to state equivalence of all the introduced interpretations of 
pathwidth. 


Theorem 7.14. For any graph G and any k > 0, the following conditions 
are equivalent: 


(i) The node search number of G is at most k +1. 
(it) The interval width of G is at most k +1. 
(iii) The pathwidth of G is at most k. 


Proof. (i) = (ii). Without loss of generality, we can remove from G all iso- 
lated vertices. Assume that there exists a search program that cleans G using 
at most k + 1 searchers. By Theorem [7.13] we can assume that this search is 
monotone, i.e., no edge becomes recontaminated. Suppose the program uses 
p moves; we will naturally index them with 1,2,...,p. Let 71, £2,..., £p be 
the sequence of vertices on which searchers are placed /from which searchers 
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are removed in consecutive moves. Since G has no isolated vertices, each ver- 
tex of G has to be occupied at some point (to clear the edges incident to 
it) and hence it appears in the sequence x), %2, ..., Zp. Also, without loss of 
generality we assume that each placed searcher is eventually removed, since 
at the end we can always remove all the searchers. 

We now claim that we can assume that for every vertex v € V(G), a 
searcher is placed on v exactly once and removed from v exactly once. Let us 
look at the first move 7 when an edge incident to v is cleaned. Note that it 
does not necessarily hold that x; = v, but vertex v has to be occupied after 
the j-th move is performed. Let jı < j be the latest move when a searcher is 
placed at v before or at move j; in particular xj, = v. Let j2 > j be the first 
move when a searcher is removed from v after move j; in particular xj, = v. 
Since j is the first move when an edge incident to v is cleaned, we have 
that before jı all the edges incident to v are contaminated. Therefore, we 
can remove from the search program any moves at vertex v that are before 
ji, Since they actually do not clean anything. On the other hand, observe 
that at move jə all the edges incident to v must be cleaned, since otherwise a 
recontamination would occur. These edges stay clean to the end of the search, 
by the monotonicity of our search program. Therefore, we can remove from 
the search program all the moves at v that occur after j2, since they actually 
do not clean anything. 

For every vertex v, we define ¢(v) as the first move when a searcher is 
placed on v, and r(v) as the move when a searcher is removed from v. Now 
with each vertex v we associate an interval I, = [€(v),r(v) — 1]; note that 
integers contained in I, are exactly moves after which v is occupied. The 
intersection graph Gz of intervals Z = {I,}vev is, of course, an interval 
graph. This graph is also a supergraph of G for the following reason. Since 
every edge uv is cleared, there is always a move jy, after which both u and 
v are occupied by searchers. Then juv € Iu O Iu, which means that [,, and I, 
have nonempty intersection. Finally, the maximum size of a clique in Gy is 
the maximum number of intervals intersecting in one point, which is at most 
the number of searchers used in the search. 

(it) = (iii). Let Gr be an interval supergraph of G with the maximum 
clique size at most k+1, and let Z = {I,},cy be the canonical representation 
of Gy. For i € {1,...,n}, we define X; as the set of vertices v of G whose 
corresponding intervals I, contain i. All intervals associated with vertices of 
X; pairwise intersect at i, which means that X; is a clique in G. Consequently 
|X;| < & +1. We claim that (X1,..., Xn) is a path decomposition of G. 
Indeed, property (P1) holds trivially. For property (P2), because Gpr is a 
supergraph of G, for every edge uv € E(G) we have I„ N I, 4 @. Since 
left endpoints of intervals of Z are 1,2,...,n, there exists also some i € 
{1,2,...,n} that belongs to I„ N I,. Consequently X; contains both u and 
v. For property (P3), from the definition of sets X; it follows that, for every 
vertex v, the sets containing v form an interval in {1,2,...,n}. 
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(iii) > (i). Let P = (X1, X2,..., X7) be a path decomposition of G of 
width at most k. We define the following search program for k + 1 searchers. 
First, we place searchers on X,. Then, iteratively for i = 2,3,...,1r, we move 
searchers from X;—ı to X; by first removing all the searchers from X;_1 \ 
X;, and then placing searchers on X; \ X;-1. Note that in this manner the 
searchers are all the time placed on the vertices of X;—1 OX;. Moreover, since 
the sizes of bags X; are upper bounded by k+1, we use at most k+1 searchers 
at a time. We now prove that this search program cleans the graph. 

By property (P2), for every edge uv € E(G) there exists a bag X; such 
that {u,v} C X;. Hence, every edge gets cleaned at some point. It remains 
to argue that no recontamination happens during implementation of this 
search program. For the sake of contradiction, suppose that recontamination 
happens for the first time when a searcher is removed from some vertex 
u € Xj_-1 \ X;, for some i > 2. This means that u has to be incident to some 
edge uv that is contaminated at this point. Again by the property (P2), 
there exists some bag X; such that {u,v} € X,. However, since u € Xj-1 
and u ¢ X;, then by property (P3) u can only appear in bags with indices 
1,2,...,2-—1. Hence j < i—1, which means that edge uv has been actually 
already cleaned before, when searchers were placed on the whole bag X;. 
Since we assumed that we consider the first time when a recontamination 
happens, we infer that edge uv must have remained clean up to this point, a 
contradiction. 














Alternative characterizations of treewidth. A similar set of equivalent 
characterizations is known for the treewidth. The role of interval graphs is 
now played by chordal graphs. Let us recall that a graph is chordal if it does 
not contain an induced cycle of length more than 3, i.e., every cycle of length 
more than 3 has a chord. Sometimes chordal graphs are called triangulated. 
It is easy to prove that interval graphs are chordal, see Exercise [7.27] 

We define the chordal width of a graph G as the minimum over all chordal 
supergraphs G” of G of the maximum clique size in G”. That is, 


chordal-width(G) = min {w(G’) : G C G’ AG’ is a chordal graph} . 





As we will see soon, the chordal width is equal to treewidth (up to a +1 sum- 
mand), but first we need to introduce the notion of graph searching related 
to the treewidth. 

The rules of the search game for treewidth are very similar to the node 
searching game for pathwidth. Again, a team of searchers, called in this set- 
ting cops, tries to capture in a graph an agile and omniscient fugitive, called 
in this setting a robber. The main difference is that the cops actually know 
where the robber is localized. At the beginning of the game cops place them- 
selves at some vertices, and then the robber chooses a vertex to start her 
escape. The game is played in rounds. Let us imagine that cops are equipped 
with helicopters. At the beginning of each round, some subset of cops take off 
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in their helicopters, declaring where they will land at the end of the round. 
While the cops are in the air, the robber may run freely in the graph; how- 
ever, she cannot pass through vertices that are occupied by cops that are not 
airborne. After the robber runs to her new location, being always a vertex, 
the airborne cops land on pre-declared vertices. As before, the cops win if 
they have a procedure to capture the robber by landing on her location, and 
the robber wins if she can avoid cops indefinitely. 

Intuitively, the main difference is that if cops are placed at some set S C 
V(G), then they know in which connected component of G — S the robber 
resides. Hence, they may concentrate the chase in this component. In the 
node search game the searchers lack this knowledge, and hence the search 
program must be oblivious to the location of the fugitive. It is easy to see 
that when cops do not see the robber, then this reduces to an equivalent 
variant of the node search game. However, if they see the robber, they can 
gain a lot. For example, two cops can catch a visible robber on any tree, but 
catching an invisible fugitive on an n-vertex tree can require log; n searchers; 
see Exercise [7.10 

Finally, let us introduce the third alternative characterization of treewidth, 
which is useful in showing lower bounds on this graph parameter. We say that 
two subsets A and B of V(G) touch if either they have a vertex in common, 
or there is an edge with one endpoint in A and second in B. A bramble is the 
family of pairwise touching connected vertex sets in G. A subset C C V(G) 
covers a bramble $ if it intersects every element of B. The least number of 
vertices covering bramble B is the order of B. 

It is not difficult to see that if G has a bramble 6 of order k + 1, then k 
cops cannot catch the robber. Indeed, during the chase the robber maintains 
the invariant that there is always some X € B that is free of cops, and in 
which she resides. This can be clearly maintained at the beginning of the 
chase, since for every set of initial position of cops there is an element X of 
B that will not contain any cops, which the robber can choose for the start. 
During every round, the robber examines the positions of all the cops after 
landing. Again, there must be some Y € B that will be disjoint from the set 
of these positions. If X = Y then the robber does not need to move, and 
otherwise she runs through X (which is connected and free from cops at this 
point) to a vertex shared with Y or to an edge connecting X and Y. In this 
manner the robber can get through to the set Y, which is free from the cops 
both at this point and after landing. 

This reasoning shows that the maximum order of a bramble in a graph G 
provides a lower bound on the number of cops needed in the search game on 
G, and thus on the treewidth of G. The following deep result of Seymour and 
Thomas shows that this lower bound is in fact tight. We state this theorem 
without a proof. 


Theorem 7.15 ( ). For every k > 0 and graph G, the treewidth of G is 
at least k if and only if G contains a bramble of order at least k + 1. 
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Now we have gathered three alternative characterizations of treewidth, 
and we can state the analogue of Theorem [7.14] for this graph parameter. 
The result is given without a proof because of its technicality, but the main 
steps are discussed in Exercise [7.28] 


Theorem 7.16. For any graph G and any k > 0, the following conditions 
are equivalent: 


(i) The treewidth of G is at most k. 

(it) The chordal width of G is at most k +1. 

(iii) k+1 cops can catch a visible robber on G. 

(iv) There is no bramble of order larger than k + 1 in G. 


7.6 Computing treewidth 


In all the applications of the treewidth we discussed in this chapter, we were 
assuming that a tree decomposition of small width is given as part of the 
input. A natural question is how fast we can find such a decomposition. 
Unfortunately, it turns out that it is NP-hard to compute the treewidth of a 
given graph, so we need to resort to FPT or approximation algorithms. 

In order to make our considerations more precise, let us consider the 
TREEWIDTH problem defined as follows: we are given a graph G and an 
integer k, and the task is to determine whether the treewidth of G is at 
most k. There is a “simple” argument why TREEWIDTH is fixed-parameter 
tractable: It follows from the results of the Graph Minors theory of Robert- 
son and Seymour, briefly surveyed in Chapter [6] More precisely, it is easy 
to see that treewidth is a minor-monotone parameter in the following sense: 
for every graph G and its minor H, it holds that tw(H) < tw(G) (see Ex- 
ercise [7.7). Hence if we define G, to be the class of graphs of treewidth at 
most k, then Gx is closed under taking minors. By Corollary [6.11] there ex- 
ists a set of forbidden minors Forb(G;,), which depends on k only, such that 
a graph has treewidth k if and only if it does not contain any graph from 
Forb(G;) as a minor. Hence, we can apply the algorithm of Theorem [6.12] to 
each H € Forb(G;) separately, verifying in f(H)-|V(G)|> time whether it is 
contained in G as a minor. Since Forb(G;,,) depends only on k, both in terms 
of its cardinality and the maximum size of a member, this algorithm runs in 
g(k) -|V(G)|° for some function g. Similarly to the algorithms discussed in 
Section this gives only a nonuniform FPT algorithm — Corollary 
does not provide us any means of constructing the family Gk, or even bound- 
ing its size. 

Alternatively, one could approach the TREEWIDTH problem using the 
graph searching game that we discussed in Section This direction quite 
easily leads to an algorithm with running time nO); the reader is asked to 
construct it in Exercise [7.26 
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It is, nonetheless, possible to obtain a uniform, constructive algorithm for 
the TREEWIDTH problem. However, this requires much more involved tech- 
niques and technical effort. In the following theorem we state the celebrated 
algorithm of Bodlaender that shows that TREEWIDTH is FPT. 


Theorem 7.17 ([45]). There exists an algorithm that, given an n-vertex 
graph G and integer k, runs in time kO?) . n and either constructs a tree 
decomposition of G of width at most k, or concludes that tw(G) > k. 


Generally, in the literature there is a variety of algorithms for comput- 
ing good tree decompositions of graphs. The algorithm of Theorem [7.17] is 
the fastest known exact parameterized algorithm, meaning that it computes 
the exact value of the treewidth. Most other works follow the direction of 
approximating treewidth. In other words, we relax the requirement that the 
returned decomposition has width at most k by allowing the algorithm to 
output a decomposition with a slightly larger width. In exchange we would 
like to obtain better parameter dependence than ROR) which would be a 
dominating factor in most applications. 

In this section we shall present the classical approximation algorithm for 
treewidth that originates in the work of Robertson and Seymour. More pre- 
cisely, we prove the following result. 


Theorem 7.18. There exists an algorithm that, given an n-vertex graph G 
and integer k, runs in time O(8*k? -n?) and either constructs a tree decom- 
position of G of width at most 4k + 4, or concludes that tw(G) > k. 


We remark that the running time of algorithm of Theorem [7.17] depends 
linearly on the size of the graph, whereas in Theorem [7.18]the dependence is 
quadratic. Although in this book we usually do not investigate precisely the 
polynomial factors in the running time bounds, we would like to emphasize 
that this particular difference is important in applications, as the running 
time of most dynamic-programming algorithms on tree decompositions de- 
pends linearly on the graph size, and, consequently, the n? factor coming 
from Theorem is a bottleneck. To cope with this issue, very recently 
a 5-approximation algorithm running in time 2°“) . n has been developed 
(see also the bibliographic notes at the end of the chapter); however, the 
techniques needed to obtain such a result are beyond the scope of this book. 

The algorithm of Theorem [7.18]is important not only because it provides 
an efficient way of finding a reasonable tree decomposition of a graph, but 
also because the general strategy employed in this algorithm has been reused 
multiple times in approximation algorithms for other structural graph pa- 
rameters. The crux of this approach can be summarized as follows. 


e It is easy to see that every n-vertex tree T contains a vertex v such 


that every connected component of T — v has at most 3 vertices. 
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A similar fact can be proved for graphs of bounded treewidth. More 
precisely, graphs of treewidth at most k have balanced separators 
of size at most k + 1: it is possible to remove k + 1 vertices from 
the graph so that every connected component left is “significantly” 
smaller than the original graph. 

e The algorithm decomposes the graph recursively. At each step we try 
to decompose some part of a graph that has only ck vertices on its 
boundary, for some constant c. 

e The crucial step is to find a small separator of the currently processed 
part H with the following property: the separator splits H into two 
pieces Hı, Hə so that the boundary of H gets partitioned evenly 
between Hı and Hə. If for some a > 1, each of Hı and Hə contains 
only, say, “ vertices of 0(H), then we have |0(H;)|,|O(H2)| < “ + 
(k+1); the summand k+1 comes from the separator. If “ +(k+1) < 
ck, then we can recurse into pieces Hı and Ho. 


Before we proceed to the proof of Theorem [7.18] we need to introduce 
some auxiliary results about balanced separators and separations in graphs 
of small treewidth. 


7.6.1 Balanced separators and separations 


From now on we will work with the notions of separators and separations 
in graphs; we have briefly discussed these concepts in Section Let us 
recall the main points and develop further these definitions. A pair of vertex 
subsets (A, B) is a separation in graph G if AU B = V(G) and there is no 
edge between A \ B and B \ A. The separator of this separation is AN B, 
and the order of separation (A, B) is equal to |AN B]. 

We say that (A, B) separates two sets X,Y if X C Aand Y C B. Note that 
then every X-Y path, i.e., a path between a vertex from X and a vertex from 
Y, has to pass through at least one vertex of A N B. Given this observation, 
we may say that a set C C V(G) separates two vertex sets X and Y if every 
X-Y path contains a vertex of C. Note that in this definition we allow C to 
contain vertices from X or Y. Given C that separates X and Y, it is easy 
to construct a separation (A, B) that separates X and Y and has C as the 
separator. 

For two vertex sets X and Y, by u(X,Y) we denote the minimum size of 
a separator separating X and Y, or, equivalently, the minimum order of a 
separation separating X and Y. Whenever the graph we refer to is not clear 
from the context, we put it as a subscript of the u symbol. By the classic the- 
orem of Menger, u(X, Y) is equal to the maximum number of vertex-disjoint 
X-Y paths. The value of (X,Y) can be computed in polynomial time by 
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any algorithm for computing the maximum flow in a graph. Moreover, within 
the same running time we can obtain both a separation of order p(X, Y) sep- 
arating X and Y, as well as a family of (X,Y) vertex-disjoint X-Y paths 
(see also Theorems [8.4] and [8.5). 

We now move to our first auxiliary result, which states that graphs of 
treewidth at most k have balanced separators of size at most k+1. We prove 
a slightly more general statement of this fact. Assume that w: V(G) > Rso 
is a nonnegative weight function on the vertices of G. For a set X C V(G), 
let us define w(X) = „ey w(u). Let a € (0,1) be any constant. We say 
that a set X C V(G) is an a-balanced separator in G if for every connected 
component D of G — X, it holds that w(D) < a-w(V(G)). Informally 
speaking, a balanced separator breaks the graph into pieces whose weights 
constitute only a constant fraction of the original weight of the graph. 


Like trees, graphs of “small” treewidth have “small” balanced separators. 
This property is heavily exploited in numerous algorithmic applications 
of treewidth. 


If we put uniform weights on vertices, then balanced separators split the 
graph more or less evenly with respect to the cardinalities of the obtained 
pieces. Recall, however, that in the presented strategy for the algorithm of 
Theorem [7.18] we would like to split evenly some smaller subset of vertices, 
which is the boundary of the currently processed part of the graph. Therefore, 
in our applications we will put weights 1 on some vertices of the graph, and 
0 on the other. Then balanced separators split evenly the set of vertices that 
are assigned weight 1. In other words, what we need is a weighted version of 
balanced separators. 

We now prove the fact that graphs of small treewidth admit small balanced 
separators. 


Lemma 7.19. Assume G is a graph of treewidth at most k, and consider a 
nonnegative weight function w: V(G) —> Rso on the vertices of G. Then in 
G there exists a t -balanced separator X of size at most k + 1. 


Proof. Let T = (T,{Xt}tevcr)) be a tree decomposition of G of width at 
most k. We prove that one of the bags of T is a ž-balanced separator we are 
looking for. We start by rooting T at an arbitrarily chosen node r. For a node 
t of T, let V; be the set of vertices of G contained in the bags of the subtree 
T, of T rooted at node t (including t itself). Let us select a node t of T such 
that: 


e w(Vi) > 4w(V(G)), and subject to 


e tis at the maximum distance in T from r. 


Observe that such a vertex t exists because root r satisfies the first condition. 
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We claim that X; is a 4-balanced separator in G. Let t),...,tp be the 
children of t in T (possibly p = 0). Observe that every connected component 
D of V(G)\ X: is entirely contained either in V(G)\V, or in V}, \ X; for some 
i € {1,2,...,p}. Since w(V;) > $w(V(G)), then w(V (G) \ Vi) < $w(V(G)). 
On the other hand, by the choice of t we have that w(Vi,) < $w(V(G)) 
for every i € {1,...,p}. Consequently, in both cases D is entirely contained 
in a set whose weight is at most }w(V(G)), which means that w(D) < 


5w(V(G)). 














The proof of Lemma|?.19|shows that if a tree decomposition of G of width 
at most k is given, one can find such a 3-balanced separator in polynomial 
time. Moreover, the found separator is one of the bags of the given decom- 
position. However, in the remainder of this section we will only need the 
existential statement. 

Recall that in the sketched strategy we were interested in splitting the 
currently processed subgraph into two even parts. However, after deleting 
the vertices of the separator given by Lemma|7.19|the graph can have more 
than two connected components. For this reason, we would like to group these 
pieces (components) into two roughly equal halves. More formally, we say that 
a separation (A, B) of Gis an a-balanced separation if w(A\ B) < a-w(V(G)) 
and w(B \ A) < a- w(V(G)). The next lemma shows that we can focus on 
balanced separations instead of separators, at a cost of relaxing l to 2, 


Lemma 7.20. Assume G is a graph of treewidth at most k, and consider a 
nonnegative weight function w: V (G) —> R>o on the vertices of G. Then in 
G there exists a 2 -balanced separation (A, B) of order at most k + 1. 


Proof. Let X bea 3-balanced separator in G of size at most k + 1, given by 
Lemma|?.19} Let Dı, D2,..., Dp be the vertex sets of connected components 
of G — X. For i € {1,2,...,p}, let a; = w(D;); recall that we know that 
a; < 4w(V(G)). By reordering the components if necessary, assume that 
a, 2 ag 2... > Ap. 

Let q be the smallest index such that }77_, a; > }w(V(G)), or q = p if no 
such index exists. We claim that }>7_, a; < }w(V(G)). This is clearly true 
if oP ai < $w(V(G)). Also, if q = 1, then X>}; a; = a1 < $w(V(G)) and 
the claim holds in this situation as well. Therefore, assume that Yi ai > 
w(V(G)) and q > 1. By the minimality of q, we have that So ai < 
w(V(G)). Hence ag < ag1 < Ei a; < Aw(V(G)), so X2; ai = ag + 
EL] a < 4w(V(G)) + 4w(V(G)) = 2w(V(G)). This proves the claim. 


1 
3 
1 
3 


We now define A = X UUj_, D: and B = X U Ui-q+1 Di- Clearly (A, B) 


is a separation of G with X being the separator, so it has order at most k +1. 
By the claim from the last paragraph, we have that w(A \ B) = DZ] ai < 
3w(V(G)). Moreover, either we have that p = q and B\ A= 9, orp < q 
and then X; a; > 3w(V(G)). In the first case w(B \ A) = 0, and in the 


latter case we have that w(B \ A) = aer ai < w(V(G)) — $w(V(G)) = 
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2w(V(G)). We conclude that (A, B) is a $-balanced separation in G of order 


at most k + 1. 


From Lemma we can infer the following corollary, which will be our 
main tool in the proof of Theorem 


Corollary 7.21. Let G be a graph of treewidth at most k and let S C V(G) 
be a vertex subset with |S| = 3k + 4. Then there exists a partition (S4, Sp) 
of S such thatk +2 < |S4|,|SB| < 2k +2 and ua(S4, SB) <k+1. 














Proof. Let us define the following weight function w: w(u) = 1 if u € S, and 
w(u) = 0 otherwise. Let (A, B) be a separation provided by Lemma [7.20] We 
know that (A, B) has order at most k+1 and that |(A\ B)NS]|,|(B\A)NS| < 
3|S| = 2k + 8. Since these cardinalities are integral, we in fact have that 
|(A\ B)N S|, |(B\ A)N S| < 2k +2. 

We now define the partition (S4, Sg). We first put (A \ B) A S into S4 
and (B \ A) N S into Sg; in this manner we have at most 2k + 2 vertices in 
Sa and at most 2k + 2 vertices in Sg. Then we iteratively examine vertices 
of (AN B) N S, and each vertex u € (AN B)NS is assigned to S4 or to Sp, 
depending on which of them is currently smaller (or arbitrarily, if they are 
equal). Since |S| = 3k +4 < 2- (2k + 2), in this manner none of the sets S4 
or Sg can get larger than 2k + 2. 

Thus we obtain that |Sa| < 2k + 2, so also |Sp| = |S| — |Sa| > k +2. 
Symmetrically |S4| > k+2. Finally, observe that separation (A, B) separates 
Sa and Spg, which proves that ua(Sa, SB) <|ANB| <k-+1. 

















7.6.2 An FPT approximation algorithm for treewidth 


Armed with our understanding of balanced separators in graphs of bounded 
treewidth, we can proceed to the proof of Theorem 


Proof (of Theorem (7.18). We will assume that the input graph G is con- 
nected, as otherwise we can apply the algorithm to each connected compo- 
nent of G separately, and connect the obtained tree decompositions arbitrar- 
ily. Furthermore, we assume that G has at most kn edges, since otherwise we 
can immediately conclude that tw(G) > k; see Exercise[7.15] Let m = |E(G)]. 

We present a recursive procedure decompose(W, S) for SC W C V(G), 
which tries to decompose the subgraph G[W] in such a way that the set S is 
contained in one bag of the decomposition. The procedure will work under 
the assumption that the following invariants are satisfied: 


(i) |S] < 3k + 4 and W \ S #9; 
(ii) both G[W] and G[W \ S] are connected; 
(iii) S = Ng(W \ S$). In other words, every vertex of S' is adjacent to some 
vertex of W \ 5, and the vertices of W \ S do not have neighbors outside 
W. 
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These invariants exactly express the properties that we want to maintain 
during the algorithm: at each point we process some connected subgraph of 
G, which communicates with the rest of G only via a small interface S. The 
output of procedure decompose(W, S) is a rooted tree decomposition Tw,s 
of the graph G[W], which has width at most 4k + 4 and satisfies S C X,, 
where r is the root of Tw,s. The whole algorithm then boils down to calling 
decompose(V(G),@); note that the invariants in this call are satisfied since 
we assumed G to be connected. 

We now explain how procedure decompose(W, S) is implemented. The 
first step is to construct a set S with the following properties: 


(a) SC Sc W, that is, Sisa proper superset of S; 

(b) |S| < 4k +5; 

(c) every connected component of G[W \ S] is adjacent to at most 3k + 4 
vertices of 9. 


Intuitively, the set S is going to form the root bag of the constructed de- 
composition. Property (b) is needed to ensure small width, and property 
will be essential for maintaining invariant (i) when we recurse into connected 
components of G[W \ S]. So far it is not clear why we require that S contains 
at least one more vertex than S. This property will be very helpful when 
estimating the running time of the algorithm. In particular, this will allow 
us to show that the number of nodes in the constructed tree decomposition 
is at most n. 7 

Construction of S is very simple if |S] < 3k + 4. We just pick an arbitrary 
vertex u € W \ S (which exists since W \ S 4 Ø), and we put S = SU {u}. 
Then \S | < 3k+4. It is straightforward to verify that properties (a), (b), and 
are satisfied in this case. 

Assume then that |S| = 3k + 4. If tw(G) < k, then also tw(G[W]) < k, 
so by Lemma [7.21] there exists a partition (S4, Spg) of S such that k +2 < 
|S4|,|SB| < 2k + 2 and pew)(S4, SB), the minimum size of a separator 
separating S4 and Sg in G[W], is at most k + 1. The algorithm iterates 
through all the 2°*+4 partitions of S into two parts, and applies a max-flow 
algorithm to verify whether the minimum order of a separation of G[W] 
separating the sides does not exceed k + 1. If no partition satisfying the 
properties expected from (S4, Sp) has been found, then by Lemma [7.21] we 
can safely conclude that tw(G[W]) > k. Hence tw(G) > k, and the whole 
algorithm can be terminated. 

Assume then that a partition (S4, Sp) satisfying the required properties 
has been obtained. This means that there exists some separation (A, B) in 
G[W] that has order at most k + 1 and separates S4 from Sg; see Fig. |7.5 
Moreover, this separation was computed during the application of a max- 
flow algorithm when partition (54,58) was considered. We now put S = 
SU(AN B). Clearly |S] < |S|+|ANB| < 3k+4+k+1 = 4k +5, so property 
(b) is satisfied. For property (c), let D be the vertex set of a connected 
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Fig. 7.5: Situation in procedure decompose(W, S) when |S| = 3k + 4 


A 


component of G[W \ S]. Since D is connected in G[W] and disjoint from 
AN B, it follows that D C A\ B or D C B\ A. Assume the former; the proof 
for the second case is symmetric. Then the vertices of S that are adjacent to 
D can be only contained in (A \ B) A S or in AN B. However, we have that 


KA\B)AOS|+|ANAB|<|Sa|+|ANA B| < (2k +2) + (k +1) = 3k +3, 


so property indeed holds. 

For property fa), we just need to check that S contains at least one more 
vertex than S. Since |S4], |SB| > k+2 and |AN B| < k+1, sets S4 \ (ANB) 
and Sg \ (AN B) are nonempty. Let ug € S4 \ (AN B) and ug € Sg \ (ANA B) 
be arbitrarily chosen vertices. By invariants and (fiii), there is a path P in 
G[W] that starts in u4, ends in ug, and whose internal vertices are contained 
in W \ S. Indeed, u4 has a neighbor wu’, in W \ S, ug has a neighbor u’'g in 
W \ S, whereas u’, and u’, can be connected by a path inside G[W \ S] due 
to the connectivity of this graph. As (A, B) separates S4 from Spg, path P 
contains at least one vertex from A N B. This vertex can be neither ua nor 
up, since these vertices do not belong to AN B. Hence AN B contains at least 
one internal vertex of P, which belongs to W \ S; this proves property (a). 

Once the set § is constructed, procedure decompose(W,S) can be com- 
pleted easily. Let D1, Dz2,...,D, be the vertex sets of connected components 
of GIW \ S| (possibly p = 0). For each i = 1,2,...,p, we call recursively 
procedure decompose(Ne|D;], Ne(D;)). Satisfaction of invariants for these 
calls follows directly from the definition of sets D;, invariant for the call 
decompose(W, S), and properties fe) and of S. Let Ti be the tree de- 
composition obtained from the call decompose(Ng[D;i], Ne(Di)), and let r; 
be its root; recall that X,, contains N(D;). We now obtain a tree decom- 
position Tw,s of G[W] as follows: create a root r with bag X, = S, and for 
every i = 1,2,...,p attach 7; below r using edge rr;. From the construction 
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it easily follows that Tw,s is a tree decomposition of G[W]. Moreover, it has 
width at most 4k + 4, since IS] < 4k + 5 (property (b)) and every tree de- 
composition 7; has width at most 4k + 4. Finally, we have that X, = S28. 
Hence Tw,s has all the required properties and can be output as the result of 
decompose(W, S). Note that p = 0 represents the base case; in this situation 
the tree decomposition Tw,s of G[W] is a single node with the bag X, = S. 

This concludes the description of the algorithm, and we are left with 
estimating its running time. For every call of decompose(W,S') when 
|S| = 3k + 4, we iterate through 23*+4 partitions of S into two sides. For 
each partition we apply, say, the Ford-Fulkerson algorithm in the graph 
G[W] to compute the maximum vertex flow between the sides. This algo- 
rithm runs a number of iterations; each iteration takes O(n + m) time and 
either concludes that the currently found flow is maximum, or augments it 
by 1. Since we are interested only in situations when the maximum flow is of 
size at most k + 1, we may terminate the computation after k + 2 iterations 
(see also Theorem [8.2). Hence, each application of the Ford-Fulkerson al- 
gorithm takes time O(k(n + m)) = O(k?n), since m < kn. In total, this 
gives time O(23*+4k2 . n) for computing set S. All the other operations 
performed in decompose(W, S), like partitioning G[W \ S] into connected 
components or joining the decompositions obtained in the recursive calls, 
can be easily implemented in O(n + m) = O(kn) time. In total, operations 
performed in procedure decompose(W, S) (excluding subcalls) take time 
O(2E+4k2 . n) = OB"? on). 

Hence, it remains to prove an upper bound on how many times procedure 
decompose(W,S) is called. Observe that each call of decompose(W, S) 
creates exactly one new node of the output tree decomposition of G (the 
root bag X, = S), so equivalently we can bound the total number of nodes 
constructed by the algorithm. This is precisely the point when we will use 
property f) of the set $ computed in each call. Let 7 be the final tree decom- 
position of G output by the algorithm. For each call decompose(W, S), we 
have that set S, which is the bag at the root of the returned decomposition 
Tw,s, contains at least one vertex u that does not belong to S. Consequently, 
the root of Tw,s is the unique top-most node of 7 whose bag contains u. 
Since for every call decompose(W, S) we can find such a vertex u, and these 
vertices are pairwise different, we infer that the total number of constructed 
nodes does not exceed the total number of vertices in the graph, which is n. 
Therefore, procedure decompose(W, S) is called at most n times, and the 
whole algorithm runs in time O(8*k? - n?). 














The algorithm of Theorem can give a negative answer only in two 
cases: if it finds out that m > kn, or when it identifies a set S of size 3k +4 
which does not satisfy the statement of Lemma|7.21]in graph G[W], and con- 
sequently also in graph G. Distinguishing the case when m > kn was needed 
only for the purpose of achieving better running time; if we drop this case, 
then the algorithm is still correct, but works in O(8*k-nm) time. Hence, af- 
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ter this small adjustment the algorithm of Theorem 7.18] provides an obstacle 
in the graph whenever it is incapable of constructing a decomposition. This 
obstacle is a set © with |S| = 3k + 4 that does not satisfy the statement of 
Corollary [7.21]in G. 

This property of a set S is a variant of the notion of well-linkedness. The 
exact definition of a well-linked set is slightly different, and the statement 
of Corollary [7.21] is contrived to its algorithmic usage in the proof of Theo- 
rem [7.18] While in this book we use a well-linked set only in the treewidth- 
approximation algorithm as a certificate for large treewidth, this notion has 
found a lot of other important algorithmic applications. In the bibliographic 
notes we provide pointers to literature giving a broader presentation of this 
subject. 


7.7 Win/win approaches and planar problems 


We start from formalizing the idea of a win/win approach, already sketched 
in the introduction of this chapter. Let us focus on the VERTEX COVER 
problem; we would like to prove that this problem is FPT using treewidth, 
ignoring for a moment the existence of a simple 2¥ -n ® branching strategy. 

Observe first that if a graph admits a vertex cover X of size k, then it 
has treewidth at most k. Indeed, one can create a simple tree decomposition 
(even a path decomposition) of width k, where every bag contains the whole 
set X and one of the vertices outside X, and the bags are arranged into a 
path arbitrarily. Given an instance (G, k) of VERTEX COVER, let us apply the 
algorithm of Theorem|7.18]to G and parameter k; this takes time O(8*k?-n?), 
where n = |V(G)|. If the algorithm concludes that tw(G) > k, then we can 
conclude that there is no vertex cover of G of size at most k. Otherwise, we 
have a tree decomposition of G of width at most 4k + 4 at hand. Hence, we 
can apply the dynamic-programming routine of Corollary [7.6] to solve the 
problem in 24*+44°) . n time. Thus, the total running time of the algorithm 
is O(8"k? -n? + 16*kO . n). 

This simple trick works well for VERTEX COVER, FEEDBACK VERTEX 
SET, or even the more general TREEWIDTH-7) MODULATOR problem, where 
for a fixed constant 7 the task is to decide whether a given graph G can be 
turned into a graph of treewidth at most 7 by deleting at most k vertices. 
Essentially, we just need two properties: that a graph in a yes-instance al- 
ways has treewidth bounded in terms of k, and that the problem admits an 
FPT algorithm when parameterized by treewidth. While the second property 
usually holds, the first one seems very restrictive. For example, consider the 
CYCLE PACKING problem: given graph G and integer k, we need to deter- 
mine whether G contains at least k vertex-disjoint cycles. This problem is in 
some sense dual to FEEDBACK VERTEX SET. So far we do not see any link 
between the treewidth of a graph and the maximum number of cycles that 
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can be packed in it. The intuition, however, is that a graph of large treewidth 
has a very complicated, entangled structure that should allow us to find as 
many cycles as we like. 

In order to follow this direction, we need to take a closer look at combi- 
natorial structures that can be always found in graphs of large treewidth. 
Equivalently, we can examine obstacles that prevent the existence of tree de- 
compositions of small width. So far we know two types of such structures. 
The first ones are brambles that we discussed in Section [7.5] However, these 
objects are quite difficult to grasp and understand, and moreover we do not 
know efficient procedures for constructing brambles of large order. The sec- 
ond ones are well-linked sets, or in our notation, sets S that do not satisfy 
the statement of Corollary [7.21] Their obvious advantage is that we get such 
a set S directly from the approximation algorithm of Theorem [7.18]in case of 
its failure. While it is still unclear why the existence of this obstacle implies 
the possibility of packing many cycles in the graph, there is at least some 
hope that from a well-linked set one could extract a more robust obstacle 
that would be easier to use. 

This intuition is indeed true, and leads to a powerful result called the 
Excluded Grid Theorem: graphs of large treewidth contain large grids as 
minors. While the proof of this result is beyond the scope this book, we 
shall state it formally and derive some algorithmic consequences using a 
win/win approach. Of particular interest for us are the corollaries for planar 
graphs. For this class of graphs the relation between the treewidth and grid 
minors is very tight, which enables us to design many efficient approxima- 
tion schemes, kernelization procedures, and, most importantly for us, fixed- 
parameter tractable algorithms. 


7.7.1 Grid theorems 


As we have discussed in the beginning of Section treewidth is a minor- 
closed parameter and hence the class G; comprising graphs of treewidth at 
most t is closed under taking minors|>| From the Graph Minors theorem it 
follows that the property of having treewidth at most t can be characterized 
by a finite set of forbidden minors Forb(G;). That is, a graph G has treewidth 
at most t if and only if it does not contain any graph H € Forb(G;) as a minor. 
But what do graphs of Forb(G,) look like? We apparently do not know the 
answer to this question. However, we would like to get some “approximate 
characterization” for graphs having treewidth at most t that is more tractable. 

The obvious first attempt is to look at minors of complete graphs. It 
is easy to see that if a graph G contains K, as a minor, then tw(G) > 
t — 1; see Exercise [7.8] . However, it is the converse implication that would 





5 As in this section we work with parameterized problems with parameter denoted by k, 
from now on we switch to a convention of denoting the width of a tree decomposition by t. 
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be more valuable to us: it would be perfect if, for some function g, the fact 
that tw(G) > g(t) would imply the existence of a K, minor model in G. 
This implication is unfortunately not true, as we will see in the next few 
paragraphs. 

Consider a large grid. More precisely, for a positive integer t, a t x t grid 
t İs a graph with vertex set {(z,y) : x,y € {1,...,¢}}. Thus Œ; has exactly 
t? vertices. Two different vertices (x,y) and (2’,y’) are adjacent if and only 
if |z — x'| + |y — y'| = 1. See the left graph in Fig.|7.6 

It is easy to show that the treewidth of H; is at most t. For example, t+ 1 
cops can always catch the fugitive by sweeping the grid column by column. 
It is harder to prove that the treewidth of H; is exactly t. Here Theorem [7.15] 
is very handy. The cross C;; of the grid is the union of the vertices of the 
i-th column and the j-th row. The set of crosses of H, forms a bramble of 
order t, hence the treewidth is at least t — 1. With a bit more work, one can 
construct a bramble of order t+ 1 in H;, and thus prove that tw(H;) = t; see 
Exercise [7.37] On the other hand, a grid is a planar graph, so by Kuratowski’s 
theorem, it does not admit a K5-minor. Therefore, K5-minor-free graphs can 
be of arbitrarily large treewidth. 

Our first attempt failed on the example of grids, so let us try to refine it by 
taking grid minors as obstacles, instead of clique minors. Since tw(H;) = t, 
a graph that contains a t x t grid as a minor must have treewidth at least 
t. What is more surprising, is that now the converse implication is true: 
there exists a function g such that every graph of treewidth larger than g(t) 
contains a t x t grid as a minor. 

This fundamental result was first proved by Robertson and Seymour. They 
did not give any explicit bounds on function g, but a bound following from 
their proof is astronomical. The first proof providing an explicit bound on 
g was later given by Robertson, Seymour, and Thomas; they showed that 
one can take g(t) = 20). There was a line of subsequent improvements 
decreasing the polynomial function in the exponent. However, whether g can 
be bounded by a polynomial function of t was open for many years. This open 
problem was resolved in 2013 by Chekuri and Chuzhoy in the affirmative. 
More precisely, they proved the following result. 

































































































































































Fig. 7.6: Example of a 6 x 6 grid He and a triangulated grid Iy 
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Theorem 7.22 (Excluded grid theorem, [73]). There exists a function 
g(t) = O(t98+°D) such that every graph of treewidth larger than g(t) contains 
4 as a minor. 




















Theorem gives the tightest relationship known so far between the 
treewidth and the size of a grid minor. It is worth remarking that its proof 
is constructive: there exists a randomized polynomial-time algorithm that 
either constructs a H; minor model, or finds a tree decomposition of the 
input graph of width at most g(t). 

Note that Theorem provides us the relationship between treewidth 
and packing cycles in a graph, which we discussed in the introduction of this 
section. Indeed, if a graph G contains a H, minor model for t = 2[vk], then 
in this model one can find ([Vk])? > k vertex-disjoint cycles: we just need 
to partition H; into 2 x 2 subgrids, and find a cycle in each of them. Hence, 
whenever a graph G has treewidth larger than g(t) = O(k*9+°)), then we 
are certain that one can pack k vertex-disjoint cycles in G. As a result, also 
for CYCLE PACKING we can apply the same win/win approach as described 
before. Namely, we run the approximation algorithm for parameter g(t). If 
tw(G) > g(t), then we conclude that (G, k) is a yes-instance. Otherwise, we 
obtain a tree decomposition of G of width at most 4g(t) + 4, on which we 
can employ dynamic programming (see Exercise {7.19}. This shows that the 
CYCLE PACKING problem is fixed-parameter tractable. 

If we assume that graph G is planar, then it is possible to get a much 
tighter relationship between the treewidth of G and the size of the largest 
grid minor that G contains. The following theorem is due to Robertson, 
Seymour and Thomas; we present here a version with refined constants due 
to Gu and Tamaki. 
























































Theorem 7.23 (Planar excluded grid theorem, [403]). Let t be 
a nonnegative integer. Then every planar graph G of treewidth at least 9t/2 
contains B; as a minor. Furthermore, for every e > 0 there exists an O(n) 
algorithm that, for a given n-vertez planar graph G and integer t, either 
outputs a tree decomposition of G of width at most (9/2 + £)t, or constructs 
a minor model of H; in G. 





















































In other words, for planar graphs the function g is linear. Since H, has 
t? vertices, every graph containing H; has at least t? vertices. Therefore, 
Theorem immediately implies the following corollary. 























Corollary 7.24. The treewidth of an n-vertez planar graph G is less than 
3 [Vn +1]. Moreover, for any £ > 0, a tree decomposition of G of width at 
most (2 +¢)|V/n+1] can be constructed in O(n) time. 


For us it is more important that the planar excluded grid theorem allow us 
to identify many parameterized problems on planar graphs with parameter 
k, such that the treewidth of the graph in every yes-instance/no-instance 
is O(Vk). For example, a quick reader probably can already see that. this 
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will be the case for the CYCLE PACKING problem. This requires inspecting 
the general win/win strategy for this problem that we explained before, and 
replacing function g provided by Theorem with the one provided by 
Theorem [7.23] We shall delve deeper into this direction in Section but 
now we proceed to one more variant of the excluded grid theorem. 

The variant we are about to introduce considers edge contractions instead 
of minors. For this, we shall define a new family of graphs which play the role 
of grids. For an integer t > 0, the graph I; is obtained from the grid H; by 
adding, for all 1 < x,y < t— 1, the edge (a +1, y), (x,y +1), and additionally 
making vertex (t,t) adjacent to all the other vertices (x, y) with x € {1,t} or 
y € {1,t}, i.e., to the whole border of H;. Graph T; is the graph in the right 


panel of Fig. 


Theorem 7.25 (Planar excluded grid theorem for edge contrac- 
tions). For every connected planar graph G and integer t > 0, if tw(G) > 
9t+5 then G contains I, as a contraction. Furthermore, for every e > 0 there 
exists an O(n?) algorithm that, given a connected planar n-vertex graph G 
and integer t, either outputs a tree decomposition of G of width (9+.¢)t+5 
or a set of edges whose contraction in G results in I. 






































Proof (sketch). By Theorem if the treewidth of G is at least 9t + 5, 
then G contains H2;;1; as a minor. This implies that, after a sequence of 
vertex deletions, edge deletions and edge contractions, G can be transformed 
to Ha41. Let us examine this sequence, omit all edge deletions, and replace 
every deletion of some vertex v with an edge contraction between v and one 
of its neighbors (such a neighbor exists by the connectivity of G). It is easy to 
see that this sequence of edge contractions transforms G into graph H which 
is a partially triangulated (2t + 1) x (2t + 1) grid, that is, a planar graph 
obtained from grid Ha:41 by adding some edges. We construct I; from H 
by contracting edges as shown in Fig. More precisely, we first contract 
the whole border of the grid so that it becomes one vertex adjacent to the 
whole border of the inner (2t — 1) x (2t — 1) subgrid. Then we contract this 
vertex onto the corner of the subgrid. At the end, we contract edges in the 
subgrid using the zig-zag pattern depicted in Fig. [7.7] to obtain the same 
diagonal in every cell. The final observation is that we must have obtained 
exactly the graph I, with no extra edges, since adding any edge to I; spoils 
its planarity—trecall that we started from a planar graph G and applied only 
edge contractions. 




































































7.7.2 Bidimensionality 


Introduction to bidimensionality. The planar excluded grid theorem 
(Theorem |7.23) provides a powerful tool for designing algorithms on planar 
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Fig. 7.7: The steps of the proof of Theorem The two first steps are the 
boundary contraction of a partial triangulation of Hg. The third step is the 
contraction to [4 




















graphs. In all these algorithms we use again the win/win approach. We first 
approximate the treewidth of a given planar graph. If the treewidth turns 
out to be small, we use dynamic programing to find a solution. Otherwise, 
we know that our graph contains a large grid as a minor, and using this we 
should be able to conclude the right answer to the instance. 

Let us first give a few examples of this strategy. In all these examples, by 
making use of the planar excluded grid theorem we obtain algorithms with 
running time subexponential in the parameter. 

Let us first look at PLANAR VERTEX COVER, i.e., for a given planar graph 
G and parameter k, we need to determine whether there exists a vertex cover 
of G of size at most k. We need to answer the following three simple questions. 























(i) How small can be a vertex cover of H;? It is easy to check that E 
contains a matching of size |¢?/2|, and hence every vertex cover of E 
is of cardinality at least |¢?/2|. 

(ii) Given a tree decomposition of width t of G, how fast can we solve 
VERTEX COVER? By Corollary [7.6] this can be done in time 2°. tO) -n. 

(iii) Is VERTEX COVER minor-closed? In other words, is it true that for 
every minor H of graph G, the vertex cover of H does not exceed the 
vertex cover of G? 


























As was observed in Chapter if the class of graphs with vertex cover 
at most k is minor-closed, i.e., a graph G has a vertex cover of size at most 
k, then the same holds for every minor of G. Thus, if G contains H; as a 
minor for some t > 2k + 2, then by G has no vertex cover of size k. By 
the planar excluded grid theorem, this means that the treewidth of a planar 
graph admitting a vertex cover of size k is smaller than 3 /2k + 2. 

We summarize the above discussion with the following algorithm. For 
t = [v2k +2] and some £ > 0, by making use of the constructive part 
of e either compute in time O(n?) a tree decomposition of G 
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of width at most (3 + )t, or we conclude that G has M; as a minor. In the 
second case, we infer that G has no vertex cover of size at most k. However, 
if a tree decomposition has been constructed, then by we can solve the 
problem in time 2(2 +2)[V2F42] . KOC) . n = 2O(V*) . n. The total running time 
of the algorithm is hence 20(V*) . n + O(n?). 

It is instructive to extract the properties of PLANAR VERTEX COVER 
which were essential for obtaining a subexponential parameterized algorithm. 

















(P1) The size of any solution in E is of order (2(t?). 

(P2) Given a tree decomposition of width t, the problem can be solved 
in time 20 . nO@), 

(P3) The problem is minor-monotone, i.e., if G has a solution of size 
at most k, then every minor of G also has a solution of size at most 


k. 

















In this argument we have used the fact that in H; the size of a mini- 
mum solution is lower-bounded by a quadratic function of t. We could apply 
the same strategy for maximization problems where H; admits a solution 
of quadratic size. For instance, consider the PLANAR LONGEST PATH and 
PLANAR LONGEST CYCLE problems: given a planar graph G and integer k, 
we are asked to determine whether G contains a path/cycle on at least k 
vertices. It is easy to see that H; contains a path on ¢? vertices and a cycle 
on t? — 1 vertices (or even t?, if t is even). Moreover, both these problems are 
minor-monotone in the following sense: if a graph H contains a path/cycle 
on at least k vertices, then so does every graph containing H as a minor. 
Finally, both these problems can be solved in time tO - n°“) when a tree 


decomposition of width t is available; see Exercise Consequently, we 
klogk) , ,O( 









































can apply the same win/win approach to obtain 20¢ 1) _time al- 
gorithms for PLANAR LONGEST PATH and PLANAR LONGEST CYCLE — the 
only difference is that when a large grid minor is found, we give a positive 
answer instead of negative. 

This suggests that properties (P1), (P2), and (P3) may constitute a base 
for a more general meta-result, which would apply to other problems as well. 
But before we formalize this idea, let us consider a few more examples. 

Our next problem is PLANAR DOMINATING SET. It is easy to check that 
the problem satisfies (P1), and we also proved that it satisfies (P2) (see The- 
orem [7-7p. However, (P3) does not hold. For example, to dominate vertices 
of a 3¢-vertex cycle C we need £ vertices. But if we add to C a universal 
vertex adjacent to all the vertices of C, then the new graph contains C as 
a minor, but has a dominating set of size 1. Therefore, the approach that 
worked for PLANAR VERTEX COVER cannot be applied directly to PLANAR 
DOMINATING SET. 

However, while not being closed under taking of minors, domination is 
closed under edge contractions. That is, if G can be dominated by k vertices, 
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then any graph obtained from G by contracting some of the edges can be 
also dominated by k vertices. What we need here is a structure similar to a 
grid to which every planar graph of large treewidth can be contracted. But 
this is exactly what Theorem [7.25] provides us! 

We thus need to answer the following question: what is the minimum 
possible size of a dominating set in I,? We do not know the exact formula (and 
this can be a difficult question to answer), but obtaining an asymptotic lower 
bound is easy. Every vertex except the lower-right corner (t,t) can dominate 
at most nine vertices. The lower-right corner vertex dominates only 4t — 4 
vertices. Thus, every dominating set of T; is of size at least Zita =R). 

In this manner, using Theorem we conclude that every connected 
planar graph with dominating set of size at most k has treewidth O(vk). 
We can now combine this observation with the dynamic programming of 
Theorem Using the same win/win approach, this gives us a QO(vk) . 
n©)-time algorithm for PLANAR DOMINATING SET. And again, the essential 
properties that we used are: (P1’) the size of a minimum solution in T; is of 
order (2(t?); (P2’) the problem can be solved in single-exponential time when 
parameterized by the treewidth of the input graph; and (P3’) the problem is 
closed under edge contraction (that is, the solution does not increase when 
edges are contracted). 


Formalizing the framework. We have already gathered enough intuition 
to be able to put the framework into more formal terms. In the following, we 
restrict our attention to vertex-subset problems. Edge-subset problems can 
be defined similarly and same arguments will work for them, hence we do not 
discuss them here. 

Let ¢ be a computable function which takes on input a graph G and a 
set S C V(G), and outputs true or false. The interpretation of ¢ is that it 
defines the space of feasible solutions S for a graph G, by returning a Boolean 
value denoting whether S is feasible or not. For example, for the DOMINATING 
SET problem we would have that ¢(G, S) = true if and only if N[S] = V (G). 
Similarly, for INDEPENDENT SET we would have that ¢(G, S) = true if and 
only if no two vertices of S are adjacent. 


For a function ġ, we define two parameterized problems, called vertex- 
subset problems: @-MINIMIZATION and @-MAXIMIZATION. In both 
problems the input consists of a graph G and a parameter k. @- 
MINIMIZATION asks whether there exists a set S C V(G) such that 
|S| < k and ¢(G,S) = true. Similarly, é- MAXIMIZATION asks whether 
there exists a set S C V(G) such that |S| > k and ¢(G,S) = true. 


Obviously, problems like DOMINATING SET, INDEPENDENT SET, or VER- 
TEX COVER are vertex-subset problems for appropriate functions ¢. We note, 
however, that this notion captures also many other problems that, at first 
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glance, have only little resemblance to the definition. A good example here is 
the familiar CYCLE PACKING problem. To see how CYCLE PACKING can be 
understood as a maximization vertex-subset problem, observe that a graph 
G contains k vertex-disjoint cycles if and only if there exists a set S C V(G) 
of size at least k with the following property: one can find in G a family 
of vertex-disjoint cycles such that each vertex of S is contained in exactly 
one of them. Hence, we can set ¢(G, S) to be true if and only if exactly this 
property is satisfied for G and S. Contrary to the examples of DOMINATING 
SET and INDEPENDENT SET, in this case it is NP-hard to check whether 
(G, S) is true for a given graph G and set S. But since we require ¢ only to 
be computable, this definition shows that CYCLE PACKING is a vertex-subset 
problem. Another examples are LONGEST PATH and LONGEST CYCLE; recall 
that in these problems we ask for the existence of a path/cycle on at least k 
vertices in a given graph. Here, we can take $(G,S) to be true if and only if 
G[S] contains a Hamiltonian path/cycle. 

Let us consider some ¢-MINIMIZATION problem Q. Observe that (G, k) E€ Q 
implies that (G, k’) € Q for all k’ > k. Similarly, if Q is a 6- MAXIMIZATION 
problem, then we have that (G, k) € Q implies that (G, k’) € Q for all k’ < k. 
Thus, the notion of optimality is well defined for vertex-subset problems. 


Definition 7.26. For a -MINIMIZATION problem Q, we define 
OPTg(G) = min{k : (G, k) € Q}. 


If there is no k such that (G, k) € Q, then we put OPTg(G) = +00. 
For a ¢-MAXIMIZATION problem Q, we define 


OPTg(G) = max{k : (G,k) € Q}. 
If there is no k such that (G, k) € Q, then we put OPTg(G) = —oo. 


We say that a (minimization or maximization) vertex-subset problem Q 
is contraction-closed if for every H that is a contraction of some graph G, 
we have that OPTg(H) < OPTg(G). Similarly, a vertex-subset problem Q 
is minor-closed if the same inequality holds for all minors H of G. Observe 
that the property of being contraction-closed or minor-closed can be checked 
by examining single graph operations for these containment notions: a con- 
traction of a single edge in the case of contractions, and vertex deletion, edge 
deletion, and edge contraction in the case of minors. 

We are now ready to define bidimensional problems. 


Definition 7.27 (Bidimensional problem). A vertex-subset problem Q 
is bidimensional if it is contraction-closed, and there exists a constant c > 0 
such that OPTg(I;) > ct? for every t > 0. 


As already pointed out in Definition |7.27| for the sake of simplicity from 
now on we will focus only on contraction bidimensionality. That is, we will 
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work with contraction-closed problems and graphs [;, rather than with 
minor-closed problems and grids H;. Being contraction-closed is a strictly 
weaker requirement than being minor-closed, whereas graphs I; and H; do 
not differ that much when it comes to arguing about lower/upper bounds 
on the optimum value for them. For this reason, on planar graphs working 
with contraction bidimensionality is usually the preferable way. We remark 
here that differences between contraction bidimensionality and minor bidi- 
mensionality become more significant when one generalizes the theory to 
H-minor-free graphs. We briefly discuss this topic at the end of this section. 

It is usually straightforward to determine whether a given problem Q 
is bidimensional according to Definition For example, let us take 
Q =VERTEX COVER. Contracting an edge does not increase the size of a 
minimum vertex cover, so the problem is contraction-closed. And as we al- 
ready observed, OPTg(Hz) > |t?/2|. Thus also OPTg(I;) > |t?/2], so 
VERTEX COVER is bidimensional. Similarly, FEEDBACK VERTEX SET, IN- 
DUCED MATCHING, CYCLE PACKING, SCATTERED SET for a fixed value of 
d, LONGEST PATH, DOMINATING SET, and r-CENTER are bidimensional as 
well; see Exercise [7.40] 

The crucial property that we used in our examples was that whenever the 
answer to the problem cannot be determined immediately, the input graph 
has treewidth roughly Vk. This is exactly what makes bidimensionality useful 
for algorithmic applications. 
























































Lemma 7.28 (Parameter-treewidth bound). Let Q be a bidimensional 
problem. Then there exists a constant ag such that for any connected planar 
graph G it holds that tw(G) < ag: .\/OPTQ(G). Furthermore, there exists a 


polynomial-time algorithm that for a given G constructs a tree decomposition 


of G of width at most ag - \/OPT9(G). 


Proof. Consider a bidimensional problem Q and a connected planar graph 
G. Let t be the maximum integer such that G contains I, as a contraction. 
Since Q is bidimensional, it is also contraction-closed, and hence OPTg(G) > 
OPT g(It) > ct?. By Theorem|7.25| the treewidth of G is at most 9t+5 < 14t. 


Thus OPTg(G) > c- mie and the first statement of the lemma follows 
for ag = 14/,\/c. To obtain the second, constructive statement, one can 
take ag = 15/\/c and apply the algorithm of Theorem [7.25] for € = 1 and 
increasing values of parameter t (starting from 1), up to the point when a 


tree decomposition is returned. 

















The next theorem follows almost directly from Lemma 


Theorem 7.29. Let Q be a bidimensional problem such that Q can be solved 
in time 201 .nO when a tree decomposition of the input graph G of width 
t is provided. Then Q is solvable in time 20K) 2°) on connected planar 
graphs. 
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Proof. The proof of the theorem is identical to the algorithms we described for 
VERTEX COVER and DOMINATING SET. Let (G, k) be an input instance of Q. 
Using Lemma we construct in polynomial time a tree decomposition G 
of width t < Se Pech If t > agVk, then we infer that k < OPT9(G). 
In such a situation we can immediately conclude that (G, k) is a no-instance 
of Q, provided Q is a minimization problem, or a yes-instance of Q, if Q is a 
maximization problem. Otherwise, we have in hand a tree decomposition of G 


of width at most agvk, and we can solve the problem in time QO(Vk) . nO) 
using the assumed treewidth-based algorithm. 














Observe that if the assumed algorithm working on a tree decomposition 
had a slightly different running time, say tO . n°, then we would need 
to make a respective adjustment in the running time of the algorithm given 
by Theorem [7.29] In the aforementioned case we would obtain an algorithm 
with running time KOVE) . nO, 

Let us remark that the requirement of connectivity of G in Theorem [7.29] 
is necessary. In Exercise [7.42] we give an example of a bidimensional problem 
Q such that Lemma [7.28]does not hold for Q and disconnected graphs. On 
the other hand, for most natural problems, in particular problems listed in 
Corollaries [7.30]and [7.31] we do not need the input graph G to be connected. 
This is due to the fact that each of these problems is monotone under removal 
of connected components. In other words, for every connected component C 
of G we have that OPTg(G — C) < OPT 9(G). It is easy to see that if 
problem Q has this property, then the requirement of connectivity can be 
lifted in Lemma [7.28]and Theorem [7.29 

We can now combine the facts that many considered problems are both 
bidimensional and solvable in single-exponential time when parameterized by 
treewidth, see Theorem [7.9|and Exercises [7.40|and [7.23] Thus, we obtain the 
following corollary of Theorem 


Corollary 7.30. The following parameterized problems can be solved in time 
2QO(VE) ROY on planar graphs: 


VERTEX COVER, 
INDEPENDENT SET, 
DOMINATING SET, 
SCATTERED SET for fixed d, 
INDUCED MATCHING, and 
r-CENTER for fixed r. 


Similarly, by combining Lemma with Theorem we obtain the 
following corollary. 


Corollary 7.31. The following parameterized problems can be solved in time 
OVE) nO) on planar graphs: 


e FEEDBACK VERTEX SET, 
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LONGEST PATH and LONGEST CYCLE, 
CYCLE PACKING, 

CONNECTED VERTEX COVER, 
CONNECTED DOMINATING SET, and 
CONNECTED FEEDBACK VERTEX SET. 


In Chapter [I1]|we will see how to improve the running time of dynamic pro- 
gramming on a tree decomposition to 20“ -n° for almost all the problems 
listed in Corollary [7.31] except CYCLE PACKING. Hence, for these problems 
the running time on planar graphs can be improved to QO(Vk) nO) as stated 
in Corollary [11.14] Actually, this can be done also for CYCLE PACKING, even 
though in Chapter [14] we will learn that the running time of t0 . n° on 
general graphs cannot be improved under reasonable complexity assumptions 
(see Theorem (14-19). We can namely use the fact that the algorithm is run 
on a planar graph of treewidth at most t, and this property can be exploited 
algorithmically via the technique of Catalan structures. This improvement, 
however, is beyond the scope of this book. 


Beyond planarity. Let us briefly mention possible extensions of bidimen- 
sionality to more general classes of graphs. The planar excluded grid theorem 
(Theorem|7.23) can be generalized to graphs excluding some fixed graph H as 
a minor, i.e., H-minor-free graphs. More precisely, Demaine and Hajiaghayi 
proved that for every fixed graph H and integer t > 0, every H-minor- 
free graph G of treewidth more than agt contains H; as a minor, where ag 
is a constant depending on H only. 

Using this, it is possible to show that the treewidth-parameter bound 
tw(G) < ag: y OPTgo(G) holds for much more general classes of apex-minor- 
free graphs. An apex graph is a graph obtained from a planar graph G by 
adding one vertex and making it adjacent to an arbitrary subset of vertices 
of G. Then a class of graphs is apex-minor-free if every graph in this class 
does not contain some fixed apex graph as a minor. Thus for example, the 
bidimensional arguments imply a subexponential parameterized algorithm 
for DOMINATING SET on apex-minor-free graphs, but do not imply such 
an algorithm for general H-minor-free graphs. While DOMINATING SET is 
solvable in subexponential parameterized time on H-minor-free graphs, the 
algorithm requires additional ideas. 

If we relax the notion of (contraction) bidimensionality to minor bidimen- 
sionality, i.e., we require that the problem Q is minor-closed and OPTg (Hi) = 
N(t?), then for minor-bidimensional problems the treewidth-parameter bound 
holds even for H-minor-free graphs. Therefore for example VERTEX COVER 
or FEEDBACK VERTEX SET admit subexponential parameterized algorithms 
on graphs excluding a fixed minor because they are minor-bidimensional. 
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7.7.38 Shifting technique 


We now present another technique for obtaining fixed-parameter tractable al- 
gorithms for problems on planar graphs using treewidth. The main idea of the 
approach originates in the work on approximation schemes on planar graphs, 
pioneered in the 1980s by Baker. The methodology is widely used in mod- 
ern approximation algorithms, and is called the shifting technique, or simply 
Baker’s technique. In this section we present a parameterized counterpart of 
this framework. 

For a vertex v of a graph G and integer r > 0, by Gi, we denote the 
subgraph of G induced by vertices within distance at most r from v in G. By 
Theorem we have the following corollary. 


Corollary 7.32. Let G be a planar graph, v be an arbitrary verter, and r be 
a nonnegative integer. Then tw(G) < 18r + 13. 


Proof. For the sake of contradiction, suppose that tw(G’,) > 18r + 14. Since 
G'i is connected and planar, by Theorem [7.25] we infer that G’) can be con- 
tracted to I>,+1. It is easy to see that for any vertex of [>,41, in particular 
for the vertex to which v was contracted, there is another vertex of I-41 at 
distance at least r + 1 from this vertex. Since contraction of edges does not 
increase the distances between vertices, this implies that there is a vertex of 
Gi, at distance at least r + 1 from v, which is a contradiction. 














It is possible to prove a better bound on the dependence of treewidth on r 
in planar graphs than the one stated in Corollary The following result 
is due to Robertson and Seymour. 


Theorem 7.33 ([398]). Let G be a planar graph, v be an arbitrary vertex, 
and r be a nonnegative integer. Then tw(G%) < 3r +1. Moreover, a tree 
decomposition of Gy, of width at most 3r +1 can be constructed in polynomial 
time. 


By Theorem we have the following corollary. 


Corollary 7.34. Let v be a vertex of a planar graph G, and for i > 0 let Li 
be the set of vertices of G that are at distance exactly i from v. Then for any 
i,j > 1, the treewidth of the subgraph Gi i+j-1 = GIL; U Lig, U- +4 U Li45-1] 
does not exceed 3j +1. Moreover, a tree decomposition of Gii4j;—-1 of width 
at most 37 + 1 can be computed in polynomial time. 


Proof. Consider the graph Gt+J—!, and note that G%~! is its induced sub- 
graph that is connected. Contract the whole subgraph G?~! to one vertex 
v*, and observe that the obtained planar graph H is exactly Gi i+j—1 with 
additional vertex v*. However, in H all the vertices are at distance at most j 
from v*, since in G every vertex from L; U Li41 U+- -U Li+j-1 is at distance at 
most j from some vertex of L;_1. By Theorem [7.33] we can compute in poly- 
nomial time a tree decomposition of H of width at most 3j + 1. As Gii4j-1 
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is an induced subgraph of H, we can easily adjust it to a tree decomposition 
of Gi i+j—1 of width at most 37 + 1. 














The basic idea behind the shifting technique is as follows. 


e Pick a vertex v of the input planar graph G, and run a breadth-first 
search (BFS) from v in G. 

e By Corollary for any choice of i,j > 1, the treewidth of the 
subgraph G;;4;~-1 induced by vertices on levels 7,4+1,...,i+j—1 
of BFS does not exceed 37 + 1. 

e Assume that in our problem we are looking for a set S of size k 
that has some property. Suppose for a moment that such S' exists. 
If we take 7 = k + 1, then we are sure that there exists a number 
q € {0,1,...,&} such that levels with indices of the form a(k+1)+¢ 
for a = 0,1,2,... do not contain any vertex of S. We can remove 
these layers from G, thus obtaining a graph G, that is a disjoint 
union of graphs G(k+1)a+q+1,(k+1)(a+1)+q—1 for a = 0,1,2,..., plus 
possibly G47! if q > 0. Note that G, still contains the whole set S. 
However, using Theorem [7.33]and Corollary [7.34] we can construct a 
tree decomposition of Gg of width at most 3k + 1. 

e To find a solution S, we iterate through all possible q € {0,1,...,k}, 
and for each q we run a dynamic-programming algorithm on the 
obtained tree decomposition of G,. If a solution exists, it survives in 
at least one graph Gq and can be uncovered by the algorithm. 





We formalize the above idea in the following lemma. 


Lemma 7.35. Let G be a planar graph and let k > 0 be an integer. Then the 
vertex set of G can be partitioned into k +1 sets (possibly empty) in such a 
manner that any k of these sets induce a graph of treewidth at most 3k + 1 
in G. Moreover, such a partition, together with tree decompositions of width 
at most 3k + 1 of respective graphs, can be found in polynomial time. 


Proof. Let us give a proof first for the case of connected graphs. Assume then 
that we are given a connected planar graph G and a nonnegative integer k. 
We select an arbitrary vertex v € V(G) and run a breadth-first. search (BFS) 
from v. For j € {0,...,k}, we define S; to be the set of vertices contained 
on levels a(k + 1) + j of the BFS, for a = 0,1,2,.... Observe that thus 
(Sj)o<j<k is a partition of V(G). Moreover, the graph G — S; is a disjoint 
union of graphs G(441)a+5+41,(k+1)(a+1)+i-1 for a = 0,1,2,..., plus possibly 
GJ! provided that j > 0. By Theorem (7.33]and Corollary [7.34] for each of 
these graphs we can construct a tree decomposition of width at most 3k + 1; 
connecting these tree decompositions arbitrarily yields a tree decomposition 
of G — S; of width at most 3k + 1. Hence, partition (5; )o<j;<, satisfies the 
required property and we have proved lemma for connected graphs. 
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Let G be a (non-connected) graph and let G1, G2, ..., Ge be its connected 
components . Then we apply the statement to each connected component G; 
separately, thus obtaining a partition (Si)o<j<x of V(G;) that satisfies the 
required property. Then for j € {0,1,...,k} take S; =Uj{_, 5%. It is easy to 
verify that partition (S;)o<;<x of V(G) satisfies the required property. 














Sometimes it is more convenient to use the following edge variant of 
Lemma The proof of the following statement can be proved in a similar 
manner as Lemma and we leave it as Exercise 


Lemma 7.36. Let G be a planar graph and k be a nonnegative integer. Then 
the edge set of G can be partitioned into k +1 sets (possibly empty) such that 
any k of these sets induce a graph of treewidth at most 3k+4 in G. Moreover, 
such a partition, together with tree decompositions of width at most 3k + 4 of 
respective graphs, can be found in polynomial time. 


We now give two examples of applications of the shifting technique. Our 
first example is SUBGRAPH ISOMORPHISM. In this problem we are given two 
graphs, host graph G and pattern graph H, and we ask whether H is iso- 
morphic to a subgraph of G. Since CLIQUE is a special case of SUBGRAPH 
ISOMORPHISM, the problem is W[1]-hard on general graphs when parame- 
terized by the size of the pattern graph. It is possible to show using color 
coding that the problem can be solved in time f(|V(H)|)|V(G)|C¢*™) for 
some function f, see Exercise[7.48] In other words, the problem is FPT if the 
pattern graph has constant treewidth. However, we now prove that if both 
the host and the pattern graph are planar, then the problem is FPT when 
parameterized by |V (H)|, without any further requirements. 

As a building block here we need the following result about solving SuB- 
GRAPH ISOMORPHISM on graphs of bounded treewidth. The lemma can be 
proved by either designing an explicit dynamic-programming algorithm, or 
using Courcelle’s theorem. We leave its proof as Exercise [7.49] 


Lemma 7.37. There exists an algorithm that solves a given instance (G, H) 
of SUBGRAPH ISOMORPHISM in time f(|V(H)|,t) -|V(G)|O™, where f is 
some function and t is the width of a given tree decomposition of G. 


Let us remark that the running time of the algorithm has to depend on 
|V (#)|; in other words, parameterization just by the treewidth will not work. 
It is in fact known that SUBGRAPH ISOMORPHISM is NP-hard on forests, i.e., 
graphs of treewidth 1. In bibliographic notes we discuss further literature on 
different parameterizations of SUBGRAPH ISOMORPHISM. 

Let us note that if we search for a connected subgraph H, then Lem- 
mas [7.33|and [7.37|are sufficient to find a solution: for each vertex v of a pla- 
nar graph, we try if a subgraph induced by the vertices of the ball of radius 
|V(H)| centered in v contains H as a subgraph. Because each such subgraph 
is of treewidth at most 3|V(H)|+1, this will give a time f(|V(H)|)-|V(G)|O 
algorithm solving SUBGRAPH ISOMORPHISM. However, if H is not connected, 
we need to use shifting technique. 
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Theorem 7.38. There exists an algorithm that, given an instance (G, H) of 
SUBGRAPH ISOMORPHISM where G and H are planar, solves this instance in 
time f(|V(H)|)-|V(G)|°™, for some function f. 


Proof. Let k = |V(#)|. Using Lemma|?.35} we construct in polynomial time a 
partition SoU- - -U Sp of V(G) such that for every i € {0,...,k}, graph G—S; 
has a tree decomposition of width at most 3k + 1. These tree decompositions 
are also provided by the algorithm of Lemma [7.35] 

By Lemma]7.37| we can solve SUBGRAPH ISOMORPHISM in time f(k) nO ® 
on each G — Sj. Because we partition the vertex set of G into k + 1 sets, for 
every k-vertex subset X of G there exists j € {0,...,k} such that XN S; = 0. 
Therefore, if G contains H as a subgraph, then for at least one value of j, 
G — Sj also contains H as a subgraph. This means that by trying each of the 
graphs G — S; for j € {0,...,k}, we will find a copy of H in G, provided 
there exists one. 














The running time of the algorithm of Theorem depends on how fast 
we can solve SUBGRAPH ISOMORPHISM on graphs of bounded treewidth. In 
bibliographic notes we sketch the current state of the art on this issue. 


Our second example concerns the MINIMUM BISECTION problem. For a 
given n-vertex graph G and integer k, the task is to decide whether there 
exists a partition of V(G) into sets A and B, such that |n/2| < |A|, |B| < 
[n/2] and the number of edges with one endpoint in A and the second in B 
is at most k. In other words, we are looking for a balanced partition (A, B) 
with an (A, B)-cut of size at most k. Such a partition (A, B) will be called a 
k-bisection of G. 

It will be convenient to work with a slightly more general variant of the 
problem. In the following, we will assume that G can be a multigraph, i.e., it 
can have multiple edges between the same pair of vertices. Moreover, the 
graph comes together with a weight function w : V(G) — Zso on ver- 
tices, and from a k-bisection (A, B) we will require that |w(V(G))/2| < 
w(A), w(B) < [w(V(G))/2]. Of course, by putting unit weights we arrive at 
the original problem. 

The goal is to prove that MINIMUM BISECTION on planar graphs is FPT 
when parameterized by k. This time we shall use yet another variant of 
Lemma [7.35] The proof of the following result is slightly more difficult than 
that of Lemma[7.35]or Lemma|[7.36] We leave it to the reader as Exercise[7.50 


Lemma 7.39. Let G be a planar graph and k be a nonnegative integer. Then 
the edge set of G can be partitioned into k+1 sets such that after contracting 
edges of any of these sets, the resulting graph admits a tree decomposition 
of width at most ck, for some constant c > 0. Moreover, such a partition, 
together with tree decompositions of width at most ck of respective graphs, 
can be found in polynomial time. 
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As a building block for FPT algorithm on planar graphs, we need the 
following lemma about fixed parameterized tractability of MINIMUM BISEC- 
TION on graphs of bounded treewidth. Its proof is again left to the reader as 
Exercise [7.51 


Lemma 7.40. MINIMUM BISECTION can be solved in time 2¢-W-n°™ on 
an n-vertex multigraph given together with its tree decomposition of width t. 
Here, W is the mazimum weight of a vertex. 


Note that here we are referring to tree decompositions of multigraphs, 
but the definitions of treewidth and tree decompositions can be naturally 
extended to multigraphs as well. We now proceed to the main result; note 
that again we state it for the generalized weighted problem. 


Theorem 7.41. MINIMUM BISECTION on planar graphs can be solved in 
time 20%) .W-n°, where W is the maximum weight of a vertex. 


Proof. We use Lemma [7.39] to partition the set of edges of the input planar 
graph G into sets So U: - -U Sp. Note here that Lemma [7.39]is formally stated 
only for simple graphs, but we may extend it to multigraphs by putting copies 
of the same edge always inside the same set S}. 

Suppose that there exists a k-bisection (A, B) of G, and let F be the set of 
edges between A and B. Then at least one of the sets S; is disjoint from F. Let 
us contract all the edges of S}, keeping multiple edges but removing created 
loops. Moreover, whenever we contract some edge uv, we define the weight of 
the resulting vertex as w(u)+w(v). Let G; be the obtained multigraph. Since 
F is disjoint from S}, during this contraction we could have just contracted 
some parts of G[A] and some parts of G[B]. Therefore, the new multigraph G; 
also admits a k-bisection (A’, B’), where A’, B’ comprise vertices originating 
in subsets of A and B, respectively. 

On the other hand, if for any G; we find some k-bisection (A’, B’), then 
uncontracting the edges of S; yields a k-bisection (A,B) of G. These two 
observations show that G admits a k-bisection if and only if at least one of the 
multigraphs G; does. However, Lemma|7.39}provided us a tree decomposition 
of each G4; of width O(k). Hence, we can apply the algorithm of Lemma|7.40 
to each Gj, and thus solve the input instance (G, k) in time 20k) . W . n07, 
Note here that the maximum weight of a vertex in each Gj is at most nW. 














Let us remark that the only properties of planar graphs that we used here 
were Theorem and the fact that planar graphs are closed under the 
operation of taking minors. We say that a class of graphs G is of bounded 
local treewidth if there exists function f such that for every graph G € G and 
every vertex v of G, it holds that tw(G7) < f(r). For example, Theorem [7.33] 
shows that planar graphs are of bounded local treewidth. Graphs of maximum 
degree d for a constant d are also of bounded local treewidth for the following 
reason: for every v and r, graph G7, contains at most d” + 1 vertices, so also 
tw(G3) < d”. However, this class is not closed under taking minors. 
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Our arguments used for planar graphs can be trivially extended to minor- 
closed graph classes of bounded local treewidth. It was shown by Eppstein 
that a minor-closed class of graphs is of bounded local treewidth if and 
only if all its members exclude some fixed apex graph as a minor. Recall 
that an apex graph is a graph obtained from a planar graph G by adding one 
vertex and making it adjacent to an arbitrary subset of vertices of G. Then a 
class of graphs is apex-minor-free if every graph in this class does not contain 
some fixed apex graph as a minor. Demaine and Hajiaghayi refined 
the result of Eppstein by showing that for every apex-minor-free graph class, 
function f in the definition of bounded local treewidth is actually linear. 
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In the PLANAR VERTEX DELETION problem we are given a graph G and 
an integer k. The question is whether there exists a vertex subset D of size 
at most k such that G — D is planar. A set D with this property will be 
henceforth called a planar deletion set. 

The class of planar graph is minor-closed and due to that PLANAR VER- 
TEX DELETION is a special case of G VERTEX DELETION for G the class of 
planar graphs. By Theorem [6.17] we have that PLANAR VERTEX DELETION 
is nonuniformly FPT. In this section we give a constructive FPT algorithm 
for PLANAR VERTEX DELETION. However, more interesting than the algo- 
rithm itself is the method of obtaining it. The algorithm is namely based on 
the irrelevant vertex technique. 

This technique originates from the work of Robertson and Seymour on 
their famous FPT algorithm for the VERTEX DISJOINT PATHS problem. In 
this problem, we are given a graph G and a set of k pairs of terminal vertices, 
{(s1,t1),---,(Sk,tz)}, and the task is to find k vertex-disjoint paths connect- 
ing all pairs of terminals. The algorithm of Robertson and Seymour is very 
complicated and uses deep structural theorems from the theory of Graph Mi- 
nors. On a very general level, the algorithm can be roughly summarized as 
follows. 

As long as the treewidth of G is large in terms of k, it is possible to find a 
vertex v that is solution-irrelevant: every collection of k paths of any solution 
can be rerouted to an equivalent one that avoids v. Thus v can be safely 
removed from the graph without changing the answer to the instance. By 
repeating this argument exhaustively, we eventually reduce the treewidth of 
the graph to some function of k. Then the problem can be solved in FPT 
time using dynamic programming. 

Since the work of Robertson and Seymour, the irrelevant vertex technique 
has been used multiple times in various parameterized algorithms. While the 
general win/win approach is natural, the devil lies in the detail. Most of 
the algorithms based on irrelevant edge/vertex techniques are very nontrivial 
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and involve a large number of technical details. The algorithm for PLANAR 
VERTEX DELETION is one of the simplest examples of applications of this 
approach, as it is possible to give an almost complete proof in just a few 
pages. Formally, we shall prove the following theorem. 


Theorem 7.42. PLANAR VERTEX DELETION can be solved in time 
gO (k? log k) ,O(1). 


The remaining part of this section is devoted to the proof of the theorem. 
We shall implement the following plan. 


e We use iterative compression (see Chapter [4) in a standard manner 
and reduce the problem to DISJOINT PLANAR VERTEX DELETION: 
For a given graph G and vertex set D of size k + 1 such that G — D 
is planar, we ask if there is a set D’ of size k such that G — D’ is 
planar and DN D' = Í. 

e The only black box ingredient which we do not explain is the fact that 
on graphs of treewidth t one can solve DISJOINT PLANAR VERTEX 
DELETION in time 20(¢!°2% ny using dynamic programming. 

e Ifthe treewidth of G—D is smaller than some threshold depending on 
k, then we solve DISJOINT PLANAR VERTEX DELETION by dynamic 
programming. Otherwise, G — D contains a large grid as a minor. 

e If the grid is sufficiently large, then it contains k + 2 “concentric” 
cycles “centered” at some vertex v. Moreover, the outermost cycle 
C separates all the other cycles from the rest of the graph, and the 
connected component of G — C containing all these cycles is planar. 
Then one can show that v is irrelevant and can be safely deleted from 
the graph; this is the most difficult part of the proof. 


We start by introducing a criterion of planarity that will be used later. Let 
C be a simple cycle in graph G. A C-bridge in G is a subgraph of G which 
is either a single edge that is a chord of C (together with its endpoints), or 
a connected component of G— V (C) together with all the edges connecting 
it with C (and their endpoints). Observe that if G is not connected, then 
by definition every connected component of G that does not contain C is 
a C-bridge. If B is a C-bridge, then the vertices of V(C) N V(B) are the 
attachment points of B. Two C-bridges B,, B2 overlap if at least one of the 
following conditions is satisfied: 


(a) Bı and Bə have at least three attachment points in common, or 

(b) cycle C contains distinct vertices a, b, c, d (in this cyclic order) such that 
a and c are attachment points of Bı, while b and d are attachment points 
of Bə. 


For a graph G with a cycle C, we define an overlap graph O(G, C) that has 
the C-bridges in G as vertices, and two C-bridges are connected by an edge 
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if they overlap. If G has a plane embedding, then in this embedding cycle 
C partitions the rest of the plane into two disjoint open sets, the faces of 
C. Observe that if two C-bridges Bı and Bz overlap, then they cannot be 
both drawn in the same face of C without intersection. This implies that if 
G is indeed planar, then the overlap graph O(G, C) has to be bipartite: sides 
of the bipartition of O(G, C) correspond to C-bridges that are placed inside 
and outside the cycle C. 

The following criterion of planarity states that the condition that O(G, C') 
is bipartite is not only necessary for G to be planar, but also sufficient. We 
leave its proof as Exercise [7.43] 


Lemma 7.43. Let C be a simple cycle in G. Then G is planar if and only if 
(i) for every C-bridge B, graph CU B is planar, and (ii) the overlap graph 
O(G,C) is bipartite. 


We now introduce the notation for separators in planar graphs. Let G 
be a connected graph, and let u,v be two distinct vertices. A set X is a 
(u, v)-separator if u,v ¢ X, but every u-v path contains a vertex of X. Note 
that in this definition we assume that the separator does not contain either 
u or v, contrary to separators that we used in Section The connected 
component of G—X that contains v is called the v-component, and its vertex 
set will be denoted by R(v, X). Equivalently, R(v, X) is the set of all the 
vertices of G that are reachable from v via paths that avoid X. We say that 
X is a connected separator if G[X] is connected, and is a cyclic separator if 
G[X] contains a Hamiltonian cycle. Of course, every cyclic separator is also 
connected. 

The following lemma will be a technical tool needed in delicate reasonings 
about finding an irrelevant vertex. 


Lemma 7.44. Let G be a connected graph, and suppose u,v € V(G) are two 
distinct vertices. Let Xı and Xa be two disjoint connected (u,v)-separators 
and let R; = R(v, X;) fori = 1,2. Then either R{UX, C Rə or RgUX2 C Ry. 


Proof. Let P be any walk in G that starts at v and ends in u. Since X; and 
Xə separate u from v, P meets both a vertex of X, and a vertex of Xə. Let 
hy(P) be the first vertex on P that belongs to X; U Xo, and let ho(P) be the 
last such vertex; in this definition P is traversed from v to u. 

We claim that if hi(P) € Xı then ho(P) € Xe, and if hi(P) € Xə then 
ho(P) € X1. These statements are symmetric, so let us prove only the first 
implication. Assume, for the sake of contradiction, that ho(P) € X, (recall 
that X, and X, are disjoint). Then consider the following v-u walk in G: we 
first traverse P from v to hı(P), then by connectivity of Xı we can travel 
through G[X,] to ha(P), and finally we traverse the end part of P from h2(P) 
to u. Since X; is disjoint from X2, by the definition of hi(P) and ha(P) we 
infer that this walk avoids X2. This is a contradiction with Xə being a (u, v)- 
separator. 
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Let us fix any v-u walk P and suppose that hy(P) € Xı and ho(P) € 
Xə. We will prove that in this case Rı UX, C Rg; the second case when 
hi(P) € X2 and h2(P) € X;, leads to the conclusion that Rə U X2 C R, in 
a symmetric manner. Take any other v-u walk P’. We claim that for P’ it 
also holds that hy(P’) € Xı and h2(P’) € X2. For the sake of contradiction, 
assume that hı(P') € X2 and ha(P') € X1. Consider the following v-u walk 
P": we first traverse P from v to hi(P), then by connectivity of Xı we can 
travel through G[X 1] to ho(P’), and finally we traverse the end part of P’ 
from h2(P’) to u. Again, by the same arguments as before walk P” avoids 
Xə, which is a contradiction to Xə being a (u, v)-separator. 

We now know that for every v-u walk P it holds that hi(P) € X; and 
ho(P) € X2. We shall prove that Rı C R2. Observe that this will prove also 
that Xı C Ro, since X; is connected, disjoint from X2, and adjacent to Rı 
via at least one edge due to the connectivity of G. Take any w € Rj; we 
need to prove that w € Rə as well. Let P, be any v-w walk that is entirely 
contained in G[R]. Prolong P,, to a v-u walk P’, by appending an arbitrarily 
chosen w-u walk. Observe now that no vertex of P, can belong to Xə — in 
such a situation we would have that h,(P/,) € X2, since no vertex of Py 
belongs to X,. Hence path P,, omits X>, which proves that w € Ro. As w 
was chosen arbitrarily, this concludes the proof. 














We are now ready to state the irrelevant vertex rule. 


Definition 7.45 (Irrelevant vertex). Let G bea graph and k be an integer. 
A vertex v of G is called an irrelevant vertex if for every vertex set D of size 
at most k, G — D is planar if and only if G — (DU {v}) is planar. 


Thus if v is an irrelevant vertex, then (G, k) is a yes-instance if and only if 
(G—v, k) is. The following lemma establishes a criterion of being an irrelevant 
vertex, and is the crucial ingredient of the algorithm. 


Lemma 7.46. Let (G,k) be an instance of PLANAR VERTEX DELETION. 
Suppose v is a vertex of G such that there exists a verter u # v and a 
sequence of pairwise disjoint cyclic (v,u)-separators X1, X2,..., Xk42 with 
the following properties: 


(a) for every i € {1,...,k +1}, X; is a subset of the R(v, Xi41), and 
(b) the graph G[R(v, Xk+2) U Xk4+2] is planar. 


Then v is an irrelevant vertex for the instance (G, k). 


Proof. Let v be a vertex of G that satisfies the conditions of the lemma. We 
show that for every set D of size at most k, G — D is planar if and only if 
G — (DU {v}) is planar. 

If G — D is planar then clearly G — (D U {v}) is planar. Suppose then 
that G — (DU {v}) is planar. By the conditions of the lemma, there exists a 
vertex u, u # v, and vertex-disjoint cyclic (v, u)-separators X1, X2,..., Xk+2 
satisfying properties (a) and (b). For i = 1,2,...,4 + 2, let Ri = R(v, X;). 
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Fig. 7.8: Situation in the proof of Lemma [7.46] Cycle C is depicted in blue. 
C-bridges B; in G—(DU {v}) that are originate in B, are depicted in yellow 
and red. The yellow C-bridge is the one that contains X;; it may not exist, but 
if it exists then it is unique. The other C-bridges (depicted red) are entirely 
contained in R;, and hence they do not have any attachment points on C 


Since separators X; are pairwise disjoint and connected, from Lemma 
and property we infer the following chain of inclusions: 


{v} CR, CRUX, C Ra C RQU XC... C Rupe C Rk+2 U Xk+2- 





As |D| < k, there are two separators X; and X; that contain no vertices 
from D. Assume without loss of generality that i < j; then we have also 
RiU X; C Rj. 

Let C be a Hamiltonian cycle in G[X,]. In order to show that G — D 
is planar, we want to apply the planarity criterion of Lemma [7.43] to cycle 
C and graph G — D. First of all, every C-bridge B in G — D that does not 
contain v is also a C-bridge in G—(DU{v}), and hence BUC is planar. Also, 
if B, is a C-bridge of G — D that contains v, then B, UC is a subgraph of 
G[R,; UX;], which in turn is a subgraph of the planar graph G|Rk+2 U Xk+2] 
(property (b)). Hence B, U C is also planar. 

Now we focus on the overlap graph O(G — D,C). Let B, be the C-bridge 
of G — D that contains v. Then B, = Bı U---U Bp U {v}, where Bq for 
q € {1,2,...,p} are some C-bridges in G — (DU {v}). Consider one B4. Since 


*7.8 Irrelevant vertex technique 221 


RUX; C Rj and R; is a connected component of G—X;, it follows that B4 can 
have a neighbor in X; only if B, contains some vertex of X;. Hence, every By 
that has an attachment point on C must intersect X;. However, separator X; 
is connected and disjoint from D, so there is a unique C-bridge in G—(DU{v}) 
that contains the whole X;. Therefore, we infer that among B1, ..., Bp there 
can be at most one C-bridge that actually has some attachment points on C— 
this is the C-bridge that contains X;—and all the other B,s are completely 
contained in R;. As these other C-bridges do not have attachment points, 
they are isolated vertices in O(G — (DU {v}), X). On the other hand, the C- 
bridge B, that contains X;, provided that it exists, has the same attachment 
points on C as C-bridge B, in G — D. 

Concluding, graph O(G — (DU {v}), X) can differ from O(G — D, X) only 
by having some additional isolated vertices, which correspond to C-bridges 
adjacent to v that are contained in R;. Because O(G — (D U {v}), X) is 
bipartite, we have that O(G — D, X) is also bipartite. Thus all the conditions 
required by Lemma [7.43] are satisfied, and we can conclude that G — D is 
planar. 














Now that we are armed with the irrelevant vertex rule, we can proceed to 
the algorithm itself. Essentially, the strategy is to try to find a situation where 
Lemmaj|7.46|can be applied, provided that the treewidth of the graph is large. 
Note that, due to property (b) from the statement of Lemma [7.46| we will 
need to identify a reasonably large part of the graph that is planar. Therefore, 
it would be perfect if we could work on a graph where a large planar part 
has been already identified. However, this is exactly the structure given to 
us as an input in the problem DISJOINT PLANAR VERTEX DELETION. That 
is, G — D is planar. Thus, to prove Theorem [7.42]it is sufficient to show the 
following lemma. 


Lemma 7.47. DISJOINT PLANAR VERTEX DELETION on an n-vertex graph 
can be solved in time 20% l8k)n°Q). Moreover, in the case of a positive 
answer, the algorithm can also return a planar deletion set of size at most k. 


Proof. The idea of the algorithm is as follows. Graph G — D is planar. Let 
us apply the algorithm of Theorem E graph G — D for £ = 4 and some 
parameter f(k) to be defined later. If we find a tree decomposition of G— D 
of width at most 5f(k), then we can create a tree decomposition of G of 
width at most 5f(k)+|D| < 5f(k)+ k+ 1 by simply adding D to every bag. 
In this situation we can use dynamic programming to solve the problem in 
FPT time. Otherwise, if the algorithm of Theorem [7.23] fails to construct a 
tree decomposition, then it returns an f(k) x f(k) grid minor in G— D. We 
will then show that G contains an irrelevant vertex which can be identified 
in polynomial time. Hence, we can identify and delete irrelevant vertices up 
to the point when the problem can be solved by dynamic programming on a 
tree decomposition of bounded width. 

It is not difficult to show that PLANAR VERTEX DELETION admits an 
FPT dynamic-programming algorithm when parameterized by the width of 
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a given tree decomposition of the input graph. For example, one can make 
use of the optimization version of Courcelle’s theorem (Theorem (7-12): we 
write an MSO» formula y(D) for a free vertex set variable D, which ver- 
ifies that G — D is planar by checking that it contains neither a K5-minor 
nor a K3 3-minor. Then minimizing |D| corresponds to finding the minimum 
possible size of a planar deletion set in G. In this argument we can also eas- 
ily handle annotations in DISJOINT PLANAR VERTEX DELETION, since we 
may additionally require in the formula that D has to be disjoint from the 
set of forbidden vertices. This gives some FPT algorithm for DISJOINT PLA- 
NAR VERTEX DELETION. However, obtaining explicit and efficient bounds on 
the running time requires additional ideas and more technical effort. In our 
algorithm we shall use the following result, which we do not prove here. 


Lemma 7.48 ([277]). DISJOINT PLANAR VERTEX DELETION can be solved 
in time 20°84 7 on an n-vertex graph given together with its tree decom- 
position of width at most t. Moreover, in the case of a positive answer, the 
algorithm can also return a planar deletion set D that has size at most k and 
is disjoint from the forbidden vertices. 


We have already all the tools to deal with the bounded-treewidth case, so 
now let us try to find a place to apply Lemma [7.46]if the treewidth is large. 
Let us fix p = 2k +5 and f(k) = (p+4)(k+2), and let H be the f(k) x f(k) 
grid. Recall that if the algorithm of Theorem [7.23] fails to construct a tree 
decomposition of small width of G — D, it provides a minor model of H in 
G — D. Let this minor model be (Iw) weve); recall that every branch set Jw 
is connected and all the branch sets are pairwise disjoint. Moreover, let Gy 
be the connected component of G — D that contains H. From the graph G 
we construct a new graph Ge by performing the following steps: 


First, delete all the vertices of G — D apart from the component Gy. 
Then, for every w € V(#) contract Gy[I,,] into a single vertex, which will 
be called n(w). 

e Iteratively take a vertex u € V(Gz) that does not belong to any branch 
set Iņ, and contract u onto any of its neighbor (such a neighbor exists due 
to connectivity of Gy). Perform this operation up to the point when there 
are no more original vertices of V (Gp) left. 


In this manner, V(G,) consists of set D and set {n(w) : w € V(H)} that 
induces in G, a graph H that is a supergraph of grid H. In other words, Ge 
consists of a partially triangulated f(k) x f(k) grid H and a set of apices D 
that can have quite arbitrary neighborhoods in H. For w€ V(H), let Jy, be 
the set of vertices of Gy that got contracted onto w. Note that Iw C Jw, 
G'|Jw] is connected, and (Jw)wev(H) forms a partition of V (Gp). 

After these operations we more-or-less see a large, flat area (planar) in the 
graph, which could possibly serve as a place where Lemma [7.46] is applied. 
However, we do not control the interaction between D and H. The goal now 
is to show that this interaction cannot be too complicated, and in fact there 
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Fig. 7.9: Choosing the family of subgrids P in a large grid H. The grids of 
P are depicted in dark grey, and their 1-frames and 2-frames are depicted in 
light grey 





exists a large enough subgrid of H that is completely nonadjacent to D. From 
now on, by somewhat abusing the notation we will identify grid H with its 
subgraph in H via mapping n; note that thus V(H) = V(#), and H can only 
have some additional edges. 

Let us partition grid H into smaller grids. Since H is an f(k) x f(k) grid 
for (p+4)(k+2), we can partition it into (k+2)? grids of size (p+4) x (p+4). 
Now for each of these subgrids perform twice the following operation: delete 
the whole boundary of the grid, thus “peeling” twice a frame around the grid. 
If M is the obtained px p subgrid, then by 2-frame and 1-frame of M, denoted 
by F5(M) and Fı(M), we mean the boundaries removed in the first and in 
the second step, respectively. Thus, the 2-frame of M induces in H a cycle of 
length 4(p +3) around M, and the 1-frame induces a cycle of length 4(p + 1) 
around M. See Fig. [7.9|for a visualization of this operation. Let P be the set 
of obtained p x p grids M (i.e., those after peeling the frames). 

We now prove the following claim, which bounds the interaction between 
a single vertex of D and the grids from P. 


Claim 7.49. If (G,k,D) is a yes-instance of DISJOINT PLANAR VERTEX 
DELETION, then in graph Ge no vertex of D can have neighbors in more than 
k+1 grids from P. 
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Proof. For the sake of contradiction, assume that there is a set D’ C V(G)\D 
such that |D’| < k and G—D’ is planar, and suppose that there exists a vertex 
x € D that in Ge has neighbors in k+2 of px p grids from P. We now examine 
what happens with set D’ during the operations that transform G to Ge. Let 
D. be the set of those vertices of Ge onto which at least one vertex of D’ 
was contracted. In other words, whenever we contract an edge, we consider 
the resulting vertex to be in D’ if and only if one of the original endpoints 
was in D’, and we define D, to be the set of vertices of Ge that end up being 
in D’ after all these contractions. It follows that |De] < |D’| < k, De is still 
disjoint from D, and Ge — De is a planar graph. 

Since D, is of size at most k, there exist at least two p x p subgrids from 
P, say Y and Z, which are adjacent to x in Ge and such that De does not 
contain any vertex of Y, or of Z, or of 1- and 2-frames of Y and Z. Let y be 
an arbitrary neighbor of x in Y, and similarly define z € Z. 

It is easy to see that for any two distinct p x p grids of P, there are at 
least p > k vertex-disjoint paths in H that connect these grids. One of these 
paths does not contain any vertex from D,. Thus, in H there exists a path P 
from y to z that contains no vertex of De: we first travel inside Y from y to 
the border of Y, then we use a D,-free path connecting Y and Z, and finally 
we get to z by travelling inside Z. 

Let us stop for a moment and examine the current situation (see also 
Fig. [7.10}. We have identified two subgrids Y and Z that are completely free 
from De together with their frames. Moreover, these two subgrids can be 
connected inside H by a path that is also free from D,. So far the picture is 
planar, but recall that we have an additional vertex x that is adjacent both 
to y € Y and to z € Z. This vertex thus creates a “handle” that connects 
two completely different regions of grid H. As we assumed that x € D and 
D. is disjoint from D, we have that x De, and hence this additional handle 
is also disjoint from De. Now the intuition is that the obtained structure 
is inherently non-planar, and therefore it cannot be contained in the planar 
graph Ge — De. To prove this formally, the easiest way is to exhibit a model 
of a Ks5-minor in Ge that does not use any vertex of De. The construction is 
depicted in Fig. [7.10 

We start with constructing a minor of K without one edge in YU F\(Y)U 
F(Y). Recall that x is adjacent to y € Y and z € Z. Define the following 
branch sets J1, [o,..., I5. First, take I, = {y}. Then, it is easy to see that one 
can partition (Y U Fi(Y)) \ {y} into three connected sets I2, I3, I4 such that 
each of them is adjacent both to y and to F2(Y). Finally, take I5 = F> (Y). 
Thus all the pairs of branch sets are adjacent, apart from J, and I5. To 
create this additional adjacency, we use vertex x. More precisely, we extend 
the branch set Jı by including x, z, and all the vertices from the path P 
traversed from z up to the point when it meets F2(Y). In this manner, branch 
set J, is still connected, and it becomes adjacent to I5. In this construction we 
used only vertices that do not belong to De, which means that J), Io,..., Is 
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Fig. 7.10: Construction of a Ks-minor using x and adjacent subgrids Y and 
Z. Path P connecting y and z is depicted in blue. 




















form a Ks5-minor model in Ge — De. This is a contradiction with De being a 
planar deletion set for Ge. di 


By Claim|7.49| in graph Ge every vertex of D has neighbors in at most k+1 
subgrids from P, or otherwise we may immediately terminate the subproblem 
by providing a negative answer. Since |P| = (k + 2)? and |D| < k +1, this 
means that we can find at least one p x p subgrid M € P that has no 
neighbors in D. Let ve be the middle vertex of M, and let v be any vertex 
of Jy,- We claim that v is irrelevant, and to prove this we will need to verify 
the prerequisites of Lemma [7.46] 

Since p = 2k + 5, we can find k + 2 vertex sets N1, No,...,Nr+2 C V(M) 
such that N; induces the i-th cycle in M around ve, counting from ve. In other 
words, sets Nk+2, Neii,..-, Ni are obtained from M by iteratively taking the 
boundary of the grid as the next N;, and removing it, up to the point when 
only ve is left. If we now uncontract the vertices of N;, then we obtain a 
connected set Y; = Unevcn,) Jw- For each i € {1,2,...,k + 2}, let C; bea 
cycle inside G’[Y;] that visits consecutive branch sets J, for w € V(N;) in 
the same order as the cycle induced by N; in M. Let X; = V(C;), and let us 
fix any vertex u € V (Ga) \ Uwevcmy Jw; such a vertex exists since M is not 
the only grid from P. 


Claim 7.50. Vertices v and u and sets X1, X2, ..., Xg+2 satisfy the prerequi- 
sites of Lemma for the instance (G,k) of PLANAR VERTEX DELETION. 
Consequently, vertex v is irrelevant for (G, k). 
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and Pw certify that v and y have to be placed on different faces of C; in any 
planar drawing of Gy 






Proof. The two prerequisites that are trivial are that each X; induces a graph 
with a Hamiltonian cycle, and that X; C R(v, Xi+1). The first follows directly 
from the definition, and the second follows from the existence of connections 
between branch sets (Jw)wev(H) implied by M. 

Therefore, we need to prove that (i) each X; is indeed a (u, v)-separator, 
and (ii) G'[Xk+2 U R(v, Xę+2)| is planar. For i € {0,1,2,...,k + 2}, let 
N<i = {v} UN, UN. U... U N; and let W; be the set of vertices of Gy 
contained in branch sets Jy for w € N<;. To prove (i) and (ii), we shall 
show that for each i > 1, it holds that R(v,X;) C Wi. In other words, all 
the vertices that can be connected to v via paths avoiding X; are actually 
contained in Gz, and moreover get contracted onto vertices contained in the 
union of the deepest i layers in M. Note that this statement implies (i), since 
u ¢ Wk+2, and also it implies (ii), since then G[X;42 U R(v, Xp42)] is an 
induced subgraph of the planar graph G — D. 

Consider any y € V (Ga) \ Wi; then in Ge vertex y gets contracted onto 
some wy ¢ N<i. Choose four vertices wy, wg, ws and ww, placed in the 
interiors of the north, east, south, and west side of the cycle M[N;], respec- 
tively; see Fig. Since M does not touch the boundary of H (it has at 
least two frames peeled), it can be easily seen that in Gy we can find four 
paths Py, Pz, Ps and Pw such that the following holds: 
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e Py and Pg connect y with the cycle C; such that both Pg and Pw travel 
only through branch sets J, for w € N<;, apart from Jwp in the case of 

Pr, and Jww in the case of Pw. Then Pg meets C; inside Jy, and Pw 
meets Ci inside Jww- 

e Py and Ps connect v with the cycle C; such that both Py and Ps travel 
only through branch sets Jų for w € N<j-1, apart from J, in the case 
of Py, and Jws in the case of Py. Then Py meets C; inside Jwy and Pg 
meets C; inside Jwg- 


Note that the family of branch sets visited by Pw U Pg is disjoint from the 
family of branch sets visited by Py U Ps. Hence, subgraphs Pw U Pr and 
Py U Pg are vertex-disjoint. 

Consider now subgraph Gp = Py U Pg U Ps U Pw UC; of Gy. Gp is 
obviously planar as a subgraph of a planar graph, and it has two C;-bridges: 
the first Py U Ps contains v, and the second PgU Pw contains y. Each of these 
C;-bridges has two attachment points, and all these four attachment points 
interleave. It follows that for every planar embedding of Gp, vertices v and 
y must be drawn on different faces of the cycle C;. Since Gp is a subgraph 
of Gy, the same holds for Gy as well. 

We are now ready to prove that R(v, X;) C Wi. For the sake of contradic- 
tion, suppose that some vertex y ¢ W; can be reached from v via a path P 
that avoids X;. By taking the first vertex outside W; on P, we may assume 
without loss of generality that all the other vertices on P belong to W;. Let 
y’ be the predecessor of y on P; then y’ € W;. Now examine where y can be 
localized in graph G. It cannot belong to D, since we know that in Ge all the 
vertices of D are not adjacent to any vertex of M, and in particular not to 
the vertex onto which y’ was contracted. Also, obviously it cannot belong to 
a different component of G—D than Gy, since y’ E€ W; C V (Gu). Therefore, 
we are left with the case when y € V(Gy) \ W;. But then we know that v 
and y must be drawn on different faces of C;, so every path contained in Gy 
that connects v and y has to intersect X;. This should in particular hold for 
P, a contradiction. 4 


By Claim [7.50] we conclude that vertex v is irrelevant, so we can safely 
delete it and proceed. Note that by the definition of being irrelevant, removal 
of v is also safe in the disjoint variant of the problem that we are currently 
solving. 

The reader may wonder now why we gave so elaborate a proof of Claim[7.50] 
even though its correctness is “obvious from the picture”. Well, as the proof 
shows, formal justification of this fact is far from being trivial, and requires 
a lot of attention to avoid hand-waving. This is a very common phenomenon 
when working with planar graphs: facts that seem straightforward in a pic- 
ture, turn out to be troublesome, or even incorrect, when a formal argument 
needs to be given. A good approach here is to try to identify the key formal 
argument that exploits planarity. In our case this argument was the observa- 
tion that in Gy, set X; has to separate v from every vertex y € V (Gy) \ Wi, 
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because v and y have to lie on different faces of C; in every planar embedding 
of Gy. Once the key argument is identified and understood, the rest of the 
proof follows easily. 

To summarize, we solve DISJOINT PLANAR VERTEX DELETION with the 
following algorithm. 


e In polynomial time we are able either to conclude that this is a no-instance, 
or to identify an irrelevant vertex and delete it. We repeat this procedure 
until the treewidth of G— D becomes at most 5f(k), and the corresponding 
tree decomposition can be constructed. 

e When we have at hand a tree decomposition of G — D of width at most 
5f(k), we use Lemma|7.48|to solve the subproblem in time 20(/(*) log £4) n, 


Since f(k) = O(k?), the total running time of our algorithm solving DISJOINT 
PLANAR VERTEX DELETION is 200 (%) log f(K)) . nOA) = 20k? log k) pOl), 














As discussed before, Lemma [7.47] with a standard application of the iter- 
ative compression technique (see Section [4.1.1) concludes the proof of Theo- 
rem [7.42] We remark that with more ideas, the running time of the algorithm 
can be improved; see the bibliographic notes. 


7.9 Beyond treewidth 


Given all the presented applications of treewidth, it is natural to ask if one 
could design other measures of structural complexity of graphs that would be 
equally useful from the algorithmic viewpoint. So far it seems that treewidth 
and pathwidth are the most successful concepts, but several other approaches 
also turned out to be fruitful in different settings. In this section we review 
briefly two examples of other width measures: branchwidth and rankwidth. 
While the first one is tightly related to treewidth, the second one goes beyond 
it. 


Branch decompositions and f-width. It will be convenient to introduce a 
more generic approach to defining structural width measures of combinatorial 
objects. Assume we are working with some finite universe U, and suppose we 
are given a function f defined on subsets of U that, for a subset X C U, is 
supposed to measure the complexity of interaction between X and U \ X. 
More precisely, f: 27 > R>o is a function that satisfies the following two 
conditions: 


e Symmetry: for each X C U, it holds that f(X) = f(U \ X); 
e Fairness: f(Q) = f(U) =0. 
Function f will be also called a cut function, or a connectivity function. 


Given some cut function f on U, we may define a class of structural de- 
compositions of U. A branch decomposition of U is a pair (T, n), where T is a 
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tree with |U| leaves whose all internal nodes have degree 3, and ņ is a bijec- 
tion from U to the leaves of T. Consider any edge e € E(T). If we remove e 
from T, then T gets split into two subtrees, and thus the set of leaves of T is 
also partitioned into two sets. Leaves of T correspond one-to-one to elements 
of U, so let (Xe, Ye) be the corresponding partition of U. We define the width 
of edge e in (T, n) as f(X-) = f (Ye); this equality holds due to the symmetry 
condition on f. The width of (T,7) is the maximum width among its edges. 
The f-width of set U, denoted by wy(U), is equal to the minimum possible 
width of its branch decomposition. If |U| = 1 then there is no decomposition, 
so we put wy(U) = 0. 

This definition is generic in the sense that we may consider different com- 
binatorial objects U and various cut functions f, thus obtaining a full vari- 
ety of possible structural parameters. Both branchwidth and rankwidth are 
defined as f-widths for appropriately chosen U and f. However, the defi- 
nition of f-width can be used to define structural width parameters of not 
only graphs, but also hypergraphs and matroids. Also, one often assumes 
that the cut function f is additionally submodular, i.e., it satisfies property 
F(X) + fY) = f(X UY) + f(X NY) for all X,Y C U. This property pro- 
vides powerful tools for approximating the f-width of a given set. In the 
bibliographic notes we give some pointers to literature on this subject. 


Branchwidth. To define branchwidth of a graph G, we put U = E(G), the 
edge set of G. For every X C E(G), its border 6(X) is the set of those vertices 
of G that are incident both to an edge of X and to an edge of E(G) \ X. The 
branchwidth of G, denoted by bw(G), is the f-width of U = E(G) with cut 
function f(X) = |6(X)|. 

It turns out that branchwidth is in some sense an equivalent definition 
of treewidth, expressed in the formalism of branch decompositions. More 
precisely, for every graph G with bw(G) > 1 it holds that 


bw(G) <tw(G) +1< 5 bw(G). (7.13) 


In many situations branchwidth turns out to be more convenient to 
work with than treewidth, for example when it comes to some dynamic- 
programming algorithms. More importantly, branchwidth seems to behave 
more robustly on planar graphs. In particular, branchwidth of a planar graph 
can be computed in polynomial time, whereas the computational complexity 
of treewidth on planar graphs remains a longstanding open question. Also, 
in the proof of the Planar Excluded Grid Theorem (Theorem]7.23) it is more 
convenient to relate the size of the largest grid minor in a graph to its branch- 
width rather than to treewidth, for instance when one tries to refine/improve 
constants. Actually, the version of Gu and Tamaki that we gave here was 
originally stated for branchwidth, and then the bound was translated for 


treewidth using (7.13). 
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It is worth mentioning that branchwidth does not use the notion of vertices 
and hence generalizes to matroids while treewidth is tied to vertices. 


Rankwidth. The motivation of rankwidth comes from the observation that 
many computational problems are tractable on classes of graphs that are 
dense, but structured. An obvious example of such graphs are complete 
graphs, but one can also allow a little bit more freedom in the structure. 
For instance, take the class of cographs, that is, graphs that can be con- 
structed from single vertices using two types of operations: (a) taking a dis- 
joint union of two constructed cographs, and (b) taking a disjoint union of 
two constructed cographs and adding a complete bipartite graph between 
their vertex sets. 

This recursive definition of cographs allows us to solve many computational 
problems using, again, the principle of dynamic programming. Essentially, 
we exploit the fact that the algebraic structure of cographs is simple — the 
adjacency relation between pieces in their decomposition is trivial: full or 
empty. However, cographs are dense in the sense that the number of edges 
may be quadratic in the number of vertices. Hence, their treewidth can be 
even linear, and all the tools that we developed in this chapter may not be 
applicable at all. Therefore, it is natural to introduce a new type of structural 
decomposition, where the main measure of width would not be the cardinality 
of a separator, but rather the complexity of the adjacency relation between 
a part of the decomposition and the rest of the graph. 

Historically, the first width parameter whose goal was to capture this phe- 
nomenon was cliquewidth. We do not give here its formal definition due to its 
technicality. Intuitively, a graph is of cliquewidth at most & if it can be built 
from single vertices by consecutively joining already constructed parts of the 
graph, and in each constructed part the vertices can be partitioned into at 
most k types such that vertices of the same type will be indistinguishable 
in later steps of the construction. The combinatorics of cliquewidth, while 
convenient for designing dynamic-programming algorithms, turned out to be 
very difficult to handle from the point of view of computing or approximat- 
ing this graph parameter. For this reason, rankwidth has been introduced. 
Rankwidth is equivalent to cliquewidth in the sense that its value is both 
lower- and upper-bounded by some functions of cliquewidth. However, for 
rankwidth it is much easier to design exact and approximation algorithms. 

We now proceed to formal definitions. For a vertex set X C V(G), we 
define its cut-rank pg(X) as follows. Consider an |X| x |V(G) \ X| ma- 
trix Be(X) = (be,y)xex,yev(G)\x With rows indexed by vertices of X and 
columns indexed by vertices of V (G) \ X. Entry bz, y is equal to 1 if z and y 
are adjacent, and 0 otherwise. Thus, Bg(X) is exactly the adjacency matrix 
of the bipartite graph induced by G between X and V(G) \ X. We define 
pa(X) to be the rank of Be(X), when treated as a matrix over the binary 
field GF(2). Then the rankwidth of G, denoted by rw(G), is equal to the 
pa-width of V(G). 
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It is easy to prove that for any graph G it holds that rw(G) < tw(G), see 
Exercise However, as expected, treewidth cannot be bounded by any 
function of rankwidth. To see this, observe that the rankwidth of a complete 
graph on n vertices is at most 1, while its treewidth is n— 1. Hence, classes of 
graphs that have bounded treewidth have also bounded rankwidth, but not 
vice versa. 

Designing dynamic-programming routines for graphs of bounded rankwidth 
are similar to those of treewidth. Instead of exploiting the property that the 
already processed part of the graph communicates with the rest via a small 
separator, we use the fact that the matrix encoding adjacency relation be- 
tween the processed part and the rest has small rank. This enables us to 
partition the processed vertices into a bounded number of classes that have 
exactly the same neighborhood in the rest of the graph. For some problems, 
vertices of the same type can be treated as indistinguishable, which leads to 
reducing the number of states of the dynamic program to a function of k. 

As in the case of treewidth, dynamic-programming algorithms on graphs 
of bounded rankwidth can be explained by a meta-theorem similar to Cour- 
celle’s theorem. For this, we need to weaken the MSO» logic that we de- 
scribed in Section [7.4.1] to MSO, logic, where we forbid quantifying over 
subsets of edges of the graph. Then, analogues of Theorems [7.11] and 
hold for MSO, and graphs of bounded rankwidth. It should not be very sur- 
prising that rankwidth, as a parameter upper-bounded by treewidth, supports 
fixed-parameter tractability of a smaller class of problems than treewidth. In- 
deed, some problems whose fixed-parameter tractability when parameterized 
by treewidth follows from Theorem [7.12] turn out to be W{1]-hard when pa- 
rameterized by rankwidth, and hence unlikely to be FPT. Examples include 
HAMILTONIAN CYCLE and EDGE DOMINATING SET; again, it is not a coin- 
cidence that these are problems where quantification over a subset of edges 
is essential. 

When it comes to the complexity of computing rankwidth, the situation 
is very similar to treewidth. Hlinény and Oum gave an f(k) - nOW- 
time algorithm computing a branch decomposition of rankwidth at most k, 
or correctly reporting that the rankwidth of a given graph is more than k. 
An approximation algorithm with factor 3 for rankwidth, running in time 
20(%) «nO, was given by Oum [B77]. 


Exercises 


7.1. Prove Lemma [7.2] 
7.2. Prove Lemma 


7.3 (®,). Find an example showing that the bound O(k|V(G)|) on the number of nodes 
of a nice tree decomposition cannot be strengthened to O(|V(G))). 
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7.4. Let T = (T,{Xt}trevcr)) be a tree decomposition of graph G, t be a node of T, and 
Xt be the corresponding bag. Show that for every connected component C of G — Xz, the 
vertices of C are contained in bags of exactly one of the connected components of T — t. 


7.5. Prove Lemma 


7.6. Prove that every clique of a graph is contained in some bag of its tree decomposition. 
Infer that tw(G) > w(G) — 1, where w(G) denotes the maximum size of a clique in G. 


7.7 (&). Show that treewidth is a minor-monotone parameter, i.e., for every minor H of 
a graph G, tw(H) < tw(G). 


7.8 (&). What is the treewidth of (a) a complete graph; (b) a complete bipartite graph; 
(c) a forest; (d) a cycle? 


7.9 (&). Show that the treewidth of a graph G is equal to the maximum treewidth of its 
biconnected components. 


7.10 (È). Show that the pathwidth of an n-vertex tree is at most [logn]. Construct a 
class of trees of pathwidth k and O(3*) vertices. 


7.11 (%). Prove that a graph has treewidth at most 2 if and only if it does not contain 
K4 as a minor. 


7.12. A graph is outerplanar if it can be embedded in the plane in such manner that all 
its vertices are on one face. What values can treewidth of an outerplanar graph have? 


7.13 (Æ). Prove that the treewidth of a simple graph cannot increase after subdividing 
any of its edges. Show that in the case of multigraphs the same holds, with the exception 
that the treewidth can possibly increase from 1 to 2. 


7.14. A graph G is called d-degenerate if every subgraph of G contains a vertex of degree 
at most d. Prove that graphs of treewidth k are k-degenerate. 


7.15. Let G be an n-vertex graph of treewidth at most k. Prove that the number of edges 
in G is at most kn. 


7.16 (&). For a graph G given together with its tree decomposition of width t, construct 
in time 0“) n a data structure such that for any two vertices x,y € V(G), it is possible 
to check in time O(t) if x and y are adjacent. You should not use any results on hashing, 
hash tables, etc. 


7.17 (Æ). Remodel the dynamic-programming algorithm of Section so that it uses 
the following definition of the value of a state: for Y C X+, value cit, Y] is equal to the 
maximum possible weight of an independent set in G[V; \ Y]. How complicated are the 
recursive formulas, compared to the algorithm given in Section [7.3.1f 


7.18. Construct the remaining algorithms listed in Theorem|7.9| That is, show algorithms 
that, given an n-vertex graph together with its tree decomposition of width at most k, 
solve: 


e Opp CYCLE TRANSVERSAL in time 3% -kO . n, 
e MaxCur in time 2% - kO® . n, 
e g-Cotorine in time q% - kO) - n. 


7.19. Construct the remaining algorithms listed in Theorem That is, show that the 
following problems can be solved in time kO\*) . n on an n-vertex graph given together 
with its tree decomposition of width at most k: 
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FEEDBACK VERTEX SET, 

HAMILTONIAN PATH and LONGEST PATH, 
HAMILTONIAN CYCLE and LONGEST CYCLE, 
CHROMATIC NUMBER, 

CYCLE PACKING, 

CONNECTED VERTEX COVER, 

CONNECTED DOMINATING SET, 
CONNECTED FEEDBACK VERTEX SET. 


7.20. List COLORING is a generalization of VERTEX COLORING: given a graph G, a set of 
colors C, and a list function L : V(G) — 2° (that is, a subset of colors L(v) for each vertex 
v), the task is to assign a color c(v) € L(v) to each vertex v € V(G) such that adjacent 
vertices receive different colors. Show that on an n-vertex graph G, List COLORING can 
be solved in time n©(tw(@)), 


7.21 (£2). Consider the following problem: given a graph G, find a minimum set of vertices 
X C V(G) such that G — X does not contain a cycle on four vertices as a subgraph. Show 


how to solve this problem in time 20(#”)n°(), where k is the treewidth of G. 


7.22 (2). Consider the following problem: given a graph G, find an induced subgraph H 
of G of maximum possible number of vertices, such that the size of the largest independent 
set in H is strictly smaller than the size of the largest independent set in G. Show how to 


solve this problem in time 22°" n©(), where k is the treewidth of G. 


7.23. In the following problems, assume that the input graph G is given together with its 
tree decomposition of width at most k. 


e Inthe INDUCED MATCHING problem, given a graph G and an integer £, we ask if there 
is a subset of 2 vertices in G inducing a matching. Show that INDUCED MATCHING can 
be solved in time 200) nO, 

e () Fora fixed integer r > 1, the r-CENTER problem, given a graph G and an integer 
£, asks to find £ vertices such that every other vertex of G is at distance at most r from 
some of these vertices. Show that r-CENTER can be solved in time (r + 1)O)nOQ), 

e (È) SCATTERED SET, given a graph G and integers £ and d, asks for at least £ vertices 


that are at pairwise distance at least d. Show that the problem can be solved in time 
dk) nO), 


7.24. Obtain an algorithm for VERTEX Cover running in time 1.3803*kO@) + O(m/n) 
by combining branching on degree 4 vertices, the 2k vertex kernel of Theorem [2.21] and 
the fact that a graph on n vertices of maximum degree 3 has pathwidth at most % +0(n), 
and a path decomposition of such width can be constructed in polynomial time (see [195}). 


7.25. Show that for a fixed graph H, the property “graph G does not contain H as a 
minor” is expressible in MSO%2. 


7.26. Using the cops and robber game, give an algorithm deciding in time n©(*) if the 
treewidth of a graph G is at most k. 


7.27 (Æ). Show that every interval graph has no induced cycles of length more than 3. 


7.28. This exercise consists of the crucial steps used to prove Theorem 


1. For a pair of vertices u, v from the same connected component of graph G, a vertex set 
S is a (u,v)-separator if u,v ¢ S and u and v are in different connected components of 
G — S. A (u,v)-separator is minimal, if it does not contain any other (u, v)-separator 
as a proper subset. Finally, a set S is a minimal separator if S is a minimal (u, v)- 
separator for some u,v € V(G). Let us remark that a minimal separator S can properly 
contain another minimal separator S$’; this can happen if S’ separates other pair of 
vertices than S. A connected component C of G — S is full if S = N(C). Show that 
every minimal separator S has at least two full components. 
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2. Show that every minimal separator of a chordal graph is a clique. 

3. (Dirac’s Lemma) A vertex v is simplicial if its closed neighborhood N[v] is a clique. 
Show that every chordal graph G on at least two vertices has at least two simplicial 
vertices. Moreover, if G is not complete, then it has at least two nonadjacent simplicial 
vertices. 

4. Prove that every chordal graph G admits a tree decomposition such that every bag of 
the decomposition is a maximal clique of G. 

5. Prove Theorem|7.16 


7.29. We define a k-tree inductively. A clique on k +1 vertices is a k-tree. A new k-tree G 
can be obtained from a smaller k-tree G’ by adding a new vertex and making it adjacent 
to k vertices of G’ that form a clique in G’. Show that every k-tree is a chordal graph of 
treewidth k. Prove that for every graph G and integer k, G is a subgraph of a k-tree if and 
only if tw(G) < k. 


7.30. Show that every maximal treewidth k-graph G, i.e., a graph such that adding any 
missing edge to G increases its treewidth, is a k-tree. 


7.31. We say that a graph G is an intersection graph of subtrees of a tree if there exists 
a tree H and a collection (Hv),cv(q) of subtrees of H such that uv € E(G) if and only if 
V(A.)NV(A,) 4 0. Show that a graph is chordal if and only if it is an intersection graph 
of subtrees of a tree. 


7.32. Let G be an n-vertex graph, and let o = (v1, v2,..., Un) be an ordering of its vertices. 
We define the width of o, denoted by tos, as follows: 


te =. max |O({v1,v2,..., vu; })|. 
i=0,1,...,n 
The vertex separation number of a graph G, denoted by vsn(G), is equal to the minimum 
possible width of an ordering of V(G). Prove that for every graph G, its vertex separation 
number is equal to its pathwidth. 


7.33 (Æ). It seems that in the algorithm of Theorem|7.18]we could directly use $-balanced 
separators given by ean OL sass of 2-balanced separations given by Lemma|7.20 
and in this manner we could reduce the approximation factor from 4 to 3. Explain what 
is the problem with this approach. 


7.34. An n-vertex graph G is called an a-edge-expander if for every set X C V(G) of size 
at most n/2 there are at least a|X| edges of G that have exactly one endpoint in X. Show 
that the treewidth of an n-vertex d-regular a-edge-expander is Q(na/d) (in particular, 
linear in n if a and d are constants). 


7.35. Show that the dependency on k in the Excluded Grid Theorem needs to be 2(k?). 
That is, show a graph of treewidth (k?) that does not contain a k x k grid as a minor. 


7.36 (2). Let H be a planar graph. Show that there is a constant cy such that the 
treewidth of every H-minor-free graph is at most cp. Show that planarity requirement 
in the statement of the exercise is crucial, i.e., that Ks5-minor-free graphs can be of any 
treewidth. 

















7.37 (È). Prove that for every t > 1, Hy contains a bramble of order t + 1 and thus 
tw( t) =t. 























7.38. Prove the following version of the classic result of Lipton and Tarjan on separators 
in planar graphs. For any planar n-vertex graph G and W C V(G), there is a set S C V(G) 
of size at most Syn + 1 such that every connected component of G — S contains at most 


wi vertices of W. 
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7.39 (2). Using Corollary prove that on an n-vertex planar graph G problems listed 
in Theorem|7.9|can be solved in time 20(V™, while problems listed in Theorem can 
be solved in time 20(V™ logn), 


7.40. Show that the following problems are bidimensional: FEEDBACK VERTEX SET, IN- 
DUCED MATCHING, CYCLE PACKING, SCATTERED SET for a fixed value of d, LONGEST 
PATH, DOMINATING SET, and r-CENTER for a fixed r. 


7.41. Show that SCATTERED SET is FPT on planar graphs, when parameterized by k + d. 
For a constant d, give an algorithm with running time 20(V*) nO), 

7.42. Define a vertex-subset maximization problem @ with the following property: for a 
graph G with the maximum size of an independent set a(G) and the number of isolated ver- 
tices (G), it holds that OPTg(G) = max{0, a(G) — 21(G)}. Prove that Q is bidimensional 
and construct a family of planar graphs Gk such that for every k > 1, OPTo (Gk) = 0 and 
tw(Gk) > k. Infer that in Lemma|[7.28] the connectivity requirement is necessary. 


7.43. Prove Lemma [7.43 


7.44 (È). The input of the PARTIAL VERTEX Cover problem is a graph G with two 
integers k and s, and the task is to check if G contains a set of at most k vertices that 
cover at least s edges. Show that on planar graphs PARTIAL VERTEX CovER is FPT 
when parameterized by k. Give a subexponential 20(V®) nO @)-time algorithm for PARTIAL 
VERTEX COVER on planar graphs. 


7.45 (2). In the TREE SPANNER problem, given a connected graph G and an integer k, 
the task is to decide if G contains a spanning tree H such that for every pair of vertices 
u,v of G, the distance between u and v in H is at most k times the distance between u 
and v in G. Show that on planar graphs TREE SPANNER is FPT when parameterized by k. 


7.46. Prove that the bounds of Corollary and Theorem cannot be strengthened 
to hold for pathwidth instead of treewidth. In other words, find an example of a planar 
graph of constant diameter that has unbounded pathwidth. 


7.47. Prove Lemma|7.36 


7.48 ($). Show that SUBGRAPH Isomorpuism can be solved in time f(|V(H)|)|V(G)|C Cw) 
for some function f. 


7.49. Prove Lemma [7.37 

7.50 (2). Prove Lemma|7.39 

7.51. Prove Lemma [7.40 

7.52 (Æ). Prove that cographs have rankwidth at most 1. 

7.53. Prove that for any graph G it holds that rw(G) < tw(G) +1. 


7.54. A rooted forest is a union of pairwise disjoint rooted trees. A depth of a rooted forest 
is the maximum number of vertices in any leaf-to-root path. An embedding of a graph G 
into a rooted forest H is an injective function f : V(G) —> V(#) such that for every edge 
uv E€ E(G), f(u) is a descendant of f(v) or f(v) is a descendant of f(u). The treedepth of 
a graph G is equal to the minimum integer d, for which there exists a rooted forest H of 
depth d and an embedding of G into H. 

Show that the pathwidth of a nonempty graph is always smaller than its treedepth. 
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7.55 (%). Let G be a graph and consider the following cut function uq(X) defined for 
X C V(G). We look at the bipartite graph Gx = (V(G), E(X, V(G) \ X)) induced in G 
by the edges between X and V(G) \ X. Then ua(X) is equal to the size of the maximum 
matching in this graph. The MM-width (mazimum-matching-width) of a graph G, denoted 
by mmw(G), is equal to the wg-width of V(G). 

Prove that there exist constants K1, K2 such that for every graph G it holds that 
Kı - tw(G) < mmw(G) < K2 - tw(G). 


Hints 


Let P = (X1, X2,...,Xr) be a path decomposition. First, add bags Xo = X-41 = 0 
at the beginning and at the end of the sequence. Second, for every 0 <i < r, insert a few 
bags between X; and X;+1: first forget, one by one, all elements of X; \ Xi41, and then 
introduce, one by one, all elements of X;+1 \ X;. Finally, collapse neighboring equal bags. 


Proceed in a way similar to Exercise[7.1] By introducing some intermediate introduce 
and forget bags, ensure that every node of the tree with at least two children has its bag 
equal to all bags of its children. Then, split such a node into a number of join nodes. 


Fix some integers k and n, where n is much larger than k. Consider a graph G 
consisting of: 


1. a clique K on k vertices; 
2. n vertices a1,a2,...,@n, where each a; is adjacent to all the vertices of K; 
3. n vertices b1,b2,...,bn, where each b; is adjacent only to vertex a;i. 


Argue that in any nice tree decomposition of G of width at most k, every vertex of K is 
introduced roughly n times, once for every pair aj, b;. 


Let v € C and assume v belongs to some bag of a connected component T, of T — t. 
By (T3), and since v ¢ X+, all bags that contain v are bags at some nodes of T,. Together 
with (T2), this implies that for every edge uv € E(G — X+), both u and v need to appear 
in bags of only one connected component of T — t. 


Let S = Xa N Xp. Pick any va € Va \ S and vp € Vp \ S. If va belong to some bag of 
Tp, then, by (T3), va € Xa and va E Xp, hence va € S, a contradiction. Thus, va appears 
only in some bags of Ta; similarly we infer that v, appears only in some bags of Tp. Hence, 
by (T2), vavo ¢ E(G). 


Let C be the vertex set of a clique in G. Use Lemma[7.3] for every edge st € E(T) of 
a tree decomposition (T, {Xt}rev(r)), C © Vs or C C V; (where Vs and V; are defined as 
in Lemma [7.3] and Exercise [7-5p. Orient the edge st towards an endpoint a € {s,t} such 
that C C Va. Argue that, if all edges of E(T) have been oriented in such a manner, then 
C is contained in Xq for any a € V(T) with outdegree zero, and that at least one such a 
exists. 


Let H be a minor of G, let (Vah)nev(m) be a minor model of H in G, and let 

, {Xt }rev(r)) be a tree decomposition of G. For every t € V(T), let X; = {h € V(H) : 
Van Xt A 0}. Argue that (T, {Xi }rev(r)) is a tree decomposition of H of width not larger 
than the width of (T, {Xt}tev(r)): 


The treewidth of Kn is n — 1, the lower bound is provided by Exercise [7.6] The 
treewidth of Ka,» is min(a, b), the lower bound is provided by Exercise|7.7|since Kynin(a,b)+1 
is a minor of Ky». The treewidth of a forest is 1 if it contains at least one edge. 
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Observe that it is easy to combine tree decompositions of biconnected components of 
a graph into one tree decomposition, since their bags can coincide only in single vertices. 


7.10] For the upper bound, consider the following procedure. In an n-vertex tree G, find 
a vertex v such that every connected components of G — v has at most n/2 vertices. 
Recursively compute path decompositions of the components of G — v, each of width at 
most [logn/2] = [logn] — 1. Arrange them one after another, adding v to every bag in 
the decomposition. 

For the lower bound, using graph searching argue that if a tree G has a vertex v such 
that at least three connected component of G — v have pathwidth at least k, then G has 
pathwidth at least k + 1. 


7.11| By Exercise and since tw(K4) = 3, we have that every graph containing K4 
as a minor has treewidth at least 3. The other direction is much more difficult: you need 
to argue, by some case analysis, that in a bramble of order 4 you can always find the 
desired minor. Alternatively, you may use the decomposition of graphs into 3-connected 
components of Tutte; see [110]. If you follow this direction, you will essentially need to 
prove the following statement: every 3-connected graph that has more than three vertices 
contains K4 as a minor. 


An isolated vertex, a tree and a cycle are outerplanar graphs, so the treewidth of 
an outerplanar graph may be equal to 0, 1, or 2. Argue that it is always at most 2, for 
example in the following way. 


1. Assume that all vertices of an outerplanar graph G lie on the infinite face. 

2. By Exercise|7.9} assume G is biconnected. 

3. Triangulate G, that is, add some edges to G, drawn inside finite faces of G, such that 
every finite face of G is a triangle. 

4. Let G* be the dual of G, and let f* € V(G*) be the infinite face. Observe that G* — f* 
is a tree. Argue that, if you assign to every f € V(G* — f*) a bag Xp consisting of the 
three vertices of G that lie on the face f, then you obtain a valid tree decomposition 
of G. 


Alternatively, one may prove that outerplanar graphs are closed under taking minors, and 
use Exercise|7.11|combined with an observation that K4 is not outerplanar. 


Since graphs of treewidth at most k are closed under taking subgraphs, it suffices 
to show that every graph of treewidth at most k contains a vertex of degree at most k. 
Consider a nice tree decomposition of a graph of treewidth at most k, and examine a vertex 
whose forget node is furthest from the root. 


Prove that a d-degenerate n-vertex graph has at most dn edges, and use Exercise|7.14 


Solution 1: Turn the given tree decomposition into a nice tree decomposition 
(T,{Xt}tevcr)) of G of width t. For every v € V(G), let t(v) be the uppermost bag 
in T that contains v. Observe that, for every uv € V(G), either u E€ X44) or v € Xiu) 
Hence, it suffices to remember Ay := Na (v) N Xi) for every v € V(G), and, for every 
query (u,v), check if u € Ay or v € Au. 

Solution 2: Prove that for every d-degenerate graph G one can in linear time compute an 
orientation of edges of G such that every vertex has outdegree at most d (while the indegree 
can be unbounded). Then for every vertex v € V(G) we can remember its outneighbors in 
this orientation. To check whether u and v are adjacent, it suffices to verify whether u is 
among the outneighbors of v (at most d checks) or v is among the outneighbors of u (again, 
at most d checks). This approach works for d-degenerate graphs, and by Exercise [7.14] we 
know that graphs of treewidth at most k are k-degenerate. 


7.19} For CHROMATIC NUMBER, argue that you need at most k + 1 colors. For other 
problems, proceed similarly as in the dynamic-programming algorithm for STEINER TREE, 
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given in Section In the cases of HAMILTONIAN PATH, LONGEST PATH, LONGEST 
CycLe, and CycLe PackIna, you will need matchings between vertices of the bag instead 
of general partitions of the bag. 


In the state of the dynamic program, keep for every pair u,v of vertices in a bag, 
whether there exists a common neighbor of u and v among the forgotten vertices that was 
not hit by the solution. It is quite nontrivial that this information is sufficient for ensuring 
that all Cas are hit, so be very careful when arguing that your algorithm is correct. 


Consider a node t of a tree decomposition (T,{Xz}zevcr)) of the input graph G. 

et Vi be the union of all bags of descendants of t in T. In the dynamic-programming 
algorithm, for every S C X; and every family A of subsets of Xz \ S, we would like to 
compute a minimum cardinality of a set S C V; such that S N X+ = S and, moreover, the 
following holds: for every independent set I in (G — S)[Vj] such that there does not exist 
an independent set I’ in G[Vi] with Vi O I =ViN I’ and |I| < |J’|, the set Vin I belongs 
to A. 


For InpuceD Marcuina, define the state by partitioning the bag at t into three 
sets: (a) vertices that have to be matched inside Gz; (b) vertices that will not be matched, 
and play no role; (c) vertices that are “reserved” for matching edges that are yet to be 
introduced, and hence we require them be nonadjacent to any vertex picked to be matched, 
or reserved. For later points, think of the vertices of type (c) as sort of a “prediction” the 
state of the dynamic program encapsulates prediction of what will be the behavior in the 
rest of the graph, and requires the partial solution to be prepared for this behavior. 

For r-CENTER, use the prediction method: for every vertex of the bag, encapsulate 
in the dynamic programming state (a) what the distance is to the nearest center from a 
partial solution inside G+ (this part is called history), and (b) what the distance will be 
to the nearest center in the whole graph, after the whole solution is uncovered (this part 
is called prediction). Remember to check in forget nodes that the prediction is consistent 
with the final history. 

For SCATTERED SET, use the same prediction approach as for r-CENTER. 


The idea is to encode the statement that G contains a minor model of H. We first 
quantify the existence of vertex sets Vp for h € V(H). Then, we need to check that (a) 
every Vp is connected; (b) the sets Vp are pairwise disjoint; (c) if hiha € E(H), then there 
exists vıv2 € E(G) with vı € Vp, and v2 E€ Vh. All three properties are easy to express 
in MSOz. 


Let us first introduce a classic approach to resolving problems about games on 
graphs. The idea is to represent all configurations of the game and their dependencies 
by an auxiliary game played on a directed graph called the arena. The node set V of 
the arena is partitioned into two sets of nodes Vi and V2. It also has a specified node 
v € V corresponding to the initial position of the game, and a subset of nodes F C V 
corresponding to the final positions. In the auxiliary game there are two players, Player 1 
and Player 2, who alternately move a token along arcs of the arena. The game starts by 
placing the token on v, and then the game alternates in rounds. At each round, if the 
token is on a vertex from V;, i = 1,2, then the i-th player slides the token from its current 
position along an arc to a new position, where the choice of the arc depends on the player. 
Player 1 wins if she manages to slide the token to a node belonging to F, and Player 2 
wins if she can avoid this indefinitely. 

Prove that there exists an algorithm with running time depending polynomially on the 
size of the arena, which determines which player wins the game. Then take the cops-and- 
robber game on the input n-vertex graph G, and encode it as an instance of the auxiliary 
game on an arena of size n©(). Positions of the arena should reflect possible configurations 
in the cops-and-robber game, i.e., locations of the cops and of the robber, whereas arcs 
should reflect their possible actions. 
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1. If Sis a minimal (u, v)-separator, then the connected components of G—S that contain 
u and v are full. 

2. Use the fact that every minimal separator S has at least two full components. 

3. Use induction on the number of vertices in G. If G has two nonadjacent vertices v1 
and v2, then a minimal (v1, v2)-separator S is a clique. Use the induction assumption 
to find a simplicial vertex in each of the full components. 

4. Let v be a simplicial vertex of G. Use induction on the number of vertices n and 
construct a tree decomposition of G from a tree decomposition of G — v. 

5. (i) > (ii). Let T = (7, {Xt}tevcr)) be a tree decomposition of width k. Let H be 
the supergraph of G obtained from G by transforming every bag of T into a clique. 
Then T = (T, {Xt }rev(r)) is also a tree decomposition of H of width k, and thus the 
maximum clique size of H does not exceed k+ 1. Show that H is chordal. (ii) => (dit). 
Here you need to construct a search strategy for the cops. This strategy imitates the 
strategy of two cops on a tree. (iii) = (iv). We have discussed in the chapter that a 
bramble of order more than k + 1 gives a strategy for the robber to avoid k + 1 cops. 
(iv) = (i). This is the (harder) half of Theorem|7.15] 


Proceed by induction, using Theorem 


In one direction, argue that if G is an intersection graph of subtrees of a tree, then 
every cycle in G of length at least 4 has a chord. In the other direction, use the fact that 
every chordal graph G has a tree decomposition (T, {Xt}zevcr)) where every bag induces 
a clique (Exercise [7.28). Define Hy =T[{t € V(T) : v € X+}] for every v € V(G). Argue 
that the tree T with subtrees (Hv) ,ev(q) witnesses that G is an intersection graph of 
subtrees of a tree. 


Prove two inequalities, that vsn(G) < pw(G) and that pw(G) < vsn(G). In both 
cases, you can make a direct construction that takes an ordering/path decomposition of 
optimum width, and builds the second object of the same width. However, it can also be 
convenient to use other characterizations of pathwidth, given in Theorem[/.14] For instance, 
you may give a strategy for searchers that exploits the existence of a path decomposition 
of small width. 

Let G be an n-vertex d-regular a-edge-expander and let X bea 3-balanced separator 
in G for a uniform weight function, given by Lemma[?.19] Let D1,...,D, be the connected 
components of G — X. Since |D;| < n/2 for every 1 <i < r, we have that at least a|D;| 
edges connect D; and X. However, since G is d-regular, at most d|X| edges leave X. 
Consequently, 


P 


X a|Dil < a XI, 





i=l 
a(n — |X|) < d| X|, 
n—> < |X; 
d+a 


By Lemma Naya — 1 is a lower bound for the treewidth of G. 


Consider a clique on k? — 1 vertices. 


Show that planar graph H is a minor of a |V(H)|C x |V(H)|°@) grid and then 
use the Excluded Grid Theorem. 


In Section we have already given a bramble of order t. Play with this example 
to squeeze one more element of the bramble. 
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Use Lemma|7.19]and Corollary [7.24 


For an FPT algorithm parameterized by k + d, you may use either bidimensionality, 
or a modification of the shifting technique where you remove d consecutive layers. For a 
subexponential algorithm for a constant d, use bidimensionality. 


To obtain any fixed-parameter tractable algorithm for parameterization by k, you 
can use the shifting technique. Getting a subexponential one is more tricky. 

The problem is not bidimensional, however, it can be reduced to a bidimensional prob- 
lem. We order the vertices V(G) = {v1,v2,...,Un} nonincreasingly according to their 
degrees. Show that, if (G,k,s) is a yes-instance, then there is a solution X such that for 
some r it holds that X C {v1,v2,...,ur}, and, moreover, X is a dominating set in the 
graph G[{v1, v2,...,vr}]. After guessing the number r, we can use bidimensionality. 


7.45| Prove that a sufficiently large grid (in terms of k) does not admit a k-spanner, and 
hence, intuitively, every yes-instance of the problem should have bounded treewidth. Also 
finding a dynamic programming on a tree decomposition is nontrivial. Here, you may find 
it useful to prove that it suffices to prove the spanner property only on single edges. A 
caveat: the problem is not contraction-closed; it is possible to contract edges and increase 
the parameter. While grid arguments will work at the end, we have to be careful here. 


Consider a tree obtained from Exercise equipped with a universal vertex. 


7.50] First prove (say, by making use of the Planar Excluded Grid Theorem) that for every 
planar graph G of treewidth t, the treewidth of its dual graph G* does not exceed ct for 
some constant c. In fact, it is possible to show that tw(G) < tw(G*) + 1 (see [B61]) but 
this is quite involved. Then deletion of edges in G corresponds to contraction of edges in 
G*, and we can use Lemma|7.36 


First, perform a color coding step: every vertex v € V(G) is colored with a color 
c(v) E€ V(#), and we look for a subgraph isomorphic to H, whose vertices have appropriate 
colors. Second, compute a tree decomposition (T, {X¢}revr)) of H of width tw(H). Third, 
perform the following dynamic programming: for every t € V(T) and mapping ¢: Xt > 
V(G) such that c(¢(h)) = h for every h € X+, compute whether there exists a (color- 
preserving) homomorphism ¢ from H[V;] to G that extends ¢. Here, color-preserving means 
that c(d(h)) = h for every h, and V; denotes the union of all bags of descendants of t in 
the tree T. 


Take a nice tree decomposition (T, {Xt}rev(r)) of G of optimum width, and try to 

urn it to a branch decomposition (T, n) of at most the same rankwidth. You will need to 
get rid of leaf nodes and introduce nodes, so just collapse them. For forget nodes do the 
following: whenever a vertex w is forgotten at node t, add a pendant leaf t’ to t and put 
n(w) = t’. After further small adjustments, you will obtain a valid branch decomposition. 
Prove that its rankwidth is at most the width of the original tree decomposition using the 
following observation: removal of each e € E(T’) splits the vertices of V(G) into sets Xe 
and Ye, such that Xe is the set of vertices forgotten in some subtree of the original tree 
decomposition. 


Intuitively, we sweep the forest H from left to right. 

Let f be an embedding of G into a rooted forest H of depth d. Order the trees of H 
arbitrarily, and order the leaves of every tree of H in a natural way, obtaining a total 
order on all the leaves of H. For every leaf h of H, let Y, be the set of all vertices on the 
path from h to the root of the tree containing h. Argue that sets f—'(Y,) form a path 
decomposition of G of width at most d — 1. 


For the left inequality Kı -tw(G) < mmw(G), without loss of generality assume G 
has no isolated vertices, and let (T, nņ) be a branch decomposition of V(G) that yields the 
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optimum jG-width. We will construct a tree decomposition (T,{Xt}revcr)) of G (i.e., 
using the same tree T) of width similar to the wg-width of (T, n). Pick any edge e € E(T), 
and let Ge be the bipartite graph Gx, where (Xe, Ye) is the partition of V(G) obtained 
by removing edge e from T. Let We C V(G) be defined as follows: v € We if and only if 
for every maximum matching M in Ge, v is an endpoint of some edge in M. Show that 
We is a vertex cover of Ge, and moreover its cardinality is at most twice the cardinality of 
the maximum matching in Ge. For every t € V(T), let X+ be the union of sets We, where 
e iterates over edges e incident to t (there is always one or three such edges). 

Now verify that (T, {Xt}:evr)) is a tree decomposition of G. Property (T1) is obvious. 
For property (T3), prove that the definition of the set We is monotone in the following 
sense: if v is a vertex of We that belongs to, say, the left side of Ge, then v stays in We 
even after moving any number of other vertices from the left side to the right side (this 
corresponds to changing the graph Ge when moving between two adjacent edges of T). 
For property (T2), focus on one edge uv € E(G) and look at the path in T between the 
leaves corresponding to the vertices u and v. Let e1,e2,...,er be the edges of this path, 
and assume they appear in this order. Argue, using (T3) and the fact that We is a vertex 
cover of Ge for each e € E(T), that there exists an index 1 < io < r such that u € We; 
for every i < ig and v E€ We, for every i > ig. Then u,v are both contained in the bag at 
the common endpoint of ej, and ej, +41. 

For the right inequality mmw(G) < Kə - tw(G), use the same construction as in Exer- 
cise [7.53 
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logic in computer science, and elements of model theory, we refer you to a wide range 
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provided in the survey of Hlinény, Oum, Seese, and Gottlob [264]. 


Part IT 
Advanced algorithmic techniques 





Chapter 8 
Finding cuts and separators 


The notion of important cuts and the related com- 
binatorial bounds give a useful tool for showing the 
fized-parameter tractability of problems where a graph 
has to be cut into certain parts. We discuss how this 
technique can be used to show that EDGE MuLtrTI- 
way CUT and DIRECTED FEEDBACK VERTEX SET are 
FPT. Random sampling of important separators is a 
recent extension of this method, making it applicable 
to a wider range of problems. We illustrate how this 
extension works on a clustering problem called (p, q)- 
PARTITION. 





Problems related to cutting a graph into parts satisfying certain properties 
or related to separating different parts of the graph from each other form a 
classical area of graph theory and combinatorial optimization, with strong 
motivation coming from applications. Many different versions of these prob- 
lems have been studied in the literature: one may remove sets of edges or 
vertices in a directed or undirected graph; the goal can be separating two 
or more terminals from each other, cutting the graph into a certain num- 
ber of parts, perhaps with constraints on the sizes of the parts, etc. Despite 
some notable exceptions (e.g., minimum s -— t cut, minimum multiway cut in 
planar graphs with fixed number of terminals), most of these problems are 
NP-hard. In this chapter, we investigate the fixed-parameter tractability of 
some of these problems parameterized by the size of the solution, that is, the 
size of the cut that we remove from the graph (one could parameterize these 
problems also by, for example, the number of terminals, while leaving the size 
of the solution unbounded, but such parameterizations are not the focus of 
this chapter). It turns out that small cuts have certain very interesting ex- 
tremal combinatorial aspects that can be exploited in FPT algorithms. The 
notion of important cuts formalizes this extremal property and gives a very 
convenient tool for the treatment of these problems. 
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There is another class of problems that are closely related to cut problems: 
transversal problems, where we have to select edges/vertices to hit certain 
objects in the graph. For example, the ODD CYCLE TRANSVERSAL problem 
asks for a set of k vertices hitting every odd cycle in the graph. As we have 
seen in Section [4.4] iterative compression can be used to transform ODD Cy- 
CLE TRANSVERSAL into a series of minimum cut problems. The DIRECTED 
FEEDBACK VERTEX SET problem asks for a set of k vertices hitting ev- 
ery directed cycle in a graph. The fixed-parameter tractability of DIRECTED 
FEEDBACK VERTEX SET was a longstanding open problem; its solution was 
eventually reached by a combination of iterative compression and solving a 
directed cut problem (essentially) using important cuts. There are other ex- 
amples where the study of a transversal problem reveals that it can be turned 
into an appropriate cut problem. 

There is a particular difficulty that we face in the presentation of the basic 
results for the cut problems. There are many variants: we can delete vertices 
or edges, the graph can be directed or undirected, we may add weights (e.g., 
to forbid certain edges/vertices to be in the solution). While the basic theory 
is very similar for these variants, they require different notation and slightly 
different proofs. In this chapter, rather than presenting the most general form 
of these results, we mostly focus on the undirected edge versions, as they are 
the most intuitive and notationally cleanest. Then we go to the directed edge 
versions to treat certain problems specific to directed graphs (e.g., DIRECTED 
FEEDBACK VERTEX SET) and finish the chapter by briefly commenting on 
the vertex variants of these results. 

We assume basic familiarity with the concepts of cuts, flows, and algo- 
rithms for finding minimum cuts. Nevertheless, Section [8.1] reviews some of 
the finer points of minimum cuts in the form we need later. Section intro- 
duces the notion of important cuts and presents the bound on their number 
and an algorithm to enumerate them. Then Section [8.3] uses important cuts 
to solve problems such as EDGE MULTIWAY CUT. 

Very recently, a new way of using important cuts was introduced in one of 
the first FPT algorithms for MULTICUT. The “random sampling of important 
separators” technique allows us to restrict our attention to solutions that have 
a certain structural property, which can make the search for the solution much 
easier or can reduce it to some other problem. After the initial application for 
multicut problems, several other results used this technique as an ingredient 
of the solution. Unfortunately, most of these results are either on directed 
graphs (where the application of the technique is more delicate) or use several 
other nontrivial steps, making their presentation beyond the scope of this 
textbook. Section [8.4] presents the technique on a clustering problem where 
the task is to partition the graphs into classes of size at most p such that there 
are at most q edges leaving each class. This example is somewhat atypical 
(as it is not directly a transversal problem), but we can use it to demonstrate 
the random sampling technique in a self-contained way. 
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Fig. 8.1: The set A({21,22,a,b,c,d,e,f}) = {em, fy2} is a minimum 
(X,Y)-cut (and hence minimal); the set A({a1,22,a,b}) = {ac, bc, bd} is 
a minimal (X,Y)-cut, but not minimum. The set A({11,22,a,c,d}) = 
{x2b, ab, bc, bd, ce, df} is an (X,Y)-cut, but not minimal 


Section generalizes the notion of important cuts to directed graphs 
and states the corresponding results without proofs. We point out that there 
are significant differences in the directed setting: the way important cuts 
were used to solve EDGE MULTIWAY CUT in undirected graphs does not 
generalize to directed graphs. Nevertheless, we can solve a certain directed cut 
problem called SKEW EDGE MULTICUT using important cuts. In Section [8.6] 
a combination of the algorithm for SKEW EDGE MULTICUT and iterative 
compression is used to show the fixed-parameter tractability of DIRECTED 
FEEDBACK VERTEX SET and DIRECTED FEEDBACK ARC SET. 

Section|8.7|discusses the version of the results for vertex-deletion problems. 
We define the notions required for handling vertex-deletion problems and 
state the most important results. We do not give any proofs in this section, 
as they are mostly very similar to their edge-removal counterparts. 


8.1 Minimum cuts 


An (X,Y)-cut is a set S of edges that separates X and Y from each other, 
that is, G\S has no X—Y path. We need to distinguish between two different 
notions of minimality. An (X,Y)-cut S is a minimum (X,Y )-cut if there is 
no (X,Y)-cut S with |S’| < |S|. An (X,Y)-cut is (inclusion-wise) minimal 
if there is no (X,Y)-cut S’ with S’ C S (see Fig. (8.1). Observe that every 
minimum cut is minimal, but not necessarily the other way around. We allow 
parallel edges in this section: this is essentially the same as having arbitrary 
integer weights on the edges. 

It will be convenient to look at minimal (X,Y )-cuts from a different per- 
spective, viewing them as edges on the boundary of a certain set of vertices. 
If G is an undirected graph and R C V(G) is a set of vertices, then we denote 
by Ag(R) the set of edges with exactly one endpoint in R, and we denote 
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dg(R) = |Ac(R)| (we omit the subscript G if it is clear from the context). 
Let S be a minimal (X,Y )-cut in G and let R be the set of vertices reachable 
from X in G \ S; clearly, we have X C R C V(G)\ Y. Then it is easy to see 
that S is precisely A(R). Indeed, every such edge has to be in S (otherwise a 
vertex of V(G) \ R would be reachable from X) and S cannot have an edge 
with both endpoints in R or both endpoints in V(G)\ R, as omitting any such 
edge would not change the fact that the set is an (X,Y)-cut, contradicting 
minimality. 


Proposition 8.1. If S is a minimal (X,Y )-cut in G, then S = Ag(R), where 
R is the set of vertices reachable from X in G \ S. 


Therefore, we may always characterize a minimal (X,Y)-cut S as A(R) for 
some set X C R C V(G)\Y. Let us also note that A(R) is an (X,Y)-cut 
for every such set R with X C R C V(G)\Y, but not necessarily a minimal 
(X, Y)-cut (see Fig. (8.1). 

The well-known maximum flow and minimum cut duality implies that 
the size of the minimum (X,Y)-cut is the same as the maximum number 
of pairwise edge-disjoint X — Y paths. Classical maximum flow algorithms 
can be used to find a minimum cut and a corresponding collection of edge- 
disjoint X — Y paths of the same size. We do not review the history of these 
algorithms and their running times here, as in our setting we usually want 
to find a cut of size at most k, where k is assumed to be a small constant. 
Therefore, the following fact is sufficient for our purposes: each round of the 
algorithm of Ford and Fulkerson takes linear time, and k rounds are sufficient 
to decide if there is an (X, Y)-cut of size at most k. We state this fact in the 
following form. 


Theorem 8.2. Given a graph G with n vertices and m edges, disjoint sets 
X,Y CV(G), and an integer k, there is an O(k(n+m))-time algorithm that 
either 


e correctly concludes that there is no (X,Y )-cut of size at most k, or 
o returns a minimum (X,Y)-cut A(R) and a collection of |A(R)| pairwise 
edge-disjoint X — Y paths. 


Submodular set functions play an essential role in many areas of combina- 
torial optimization, and they are especially important for problems involving 
cuts and connectivity. Let f: 2V(©) > R be a set function assigning a real 
number to each subset of vertices of a graph G. We say that f is submodular 
if it satisfies the following inequality for every A, B C V(G): 


f(A) + f(B) > f(AN B) + f(AU B). (8.1) 


We will use the well-known fact that the function dg(X) = |Ag(X)| is sub- 
modular. 


Theorem 8.3. The function dg is submodular for every undirected graph G. 
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Fig. 8.2: The different types of edges in the proof of Theorem [8.3] 


Proof. Let us classify each edge e according to the location of its endpoints 
(see Fig. and calculate its contribution to the two sides of (8.1): 


1. If both endpoints of e are in ANB, in A\ B, in B\ A, or in V(G)\ (AUB), 
then e contributes 0 to both sides. 

2. If one endpoint of e is in AN B, and the other is either in A \ B or in 
B\ A, then e contributes 1 to both sides. 

3. If one endpoint of e is in V(G) \ (AUB), and the other is either in A\ B 
or in B \ A, then e contributes 1 to both sides. 

4. If e is between AN B and V(G) \ (AU B), then e contributes 2 to both 
sides. 

5. If eis between A\ B and B\ A, then e contributes 2 to the left-hand side 
and 0 to the right-hand side. 


As the contribution of each edge e to the left-hand side is at least as much 
as its contribution to the right-hand side, inequality (8.1) follows. 














A reason why submodularity of dg is particularly relevant to cut problems 
is that if A(A) and A(B) are both (X,Y )-cuts, then A(AN B) and A(AUB) 
are both (X,Y)-cuts: indeed, A N B and AU B both contain X and are 
disjoint from Y. Therefore, we can interpret Theorem|8.3]as saying that if we 
have two (X,Y )-cuts A(A), A(B) of a certain size, then two new (X, Y)-cuts 
A(AN B), A(AU B) can be created and there is a bound on their total size. 

The minimum (X,Y)-cut is not necessarily unique; in fact, a graph can 
have a large number of minimum (X, Y )-cuts. Suppose, for example, that X = 
{x}, Y = {y}, and k paths connect x and y, each of length n. Then selecting 
one edge from each path gives a minimum (X,Y)-cut, hence there are n” 
different minimum (X, Y)-cuts. However, as we show below, there is a unique 
minimum (X,Y)-cut A(Rmin) that is closest to X and a unique minimum 
(X,Y)-cut A(Rmax) closest to Y, in the sense that the sets Rmin and Rmax 
are minimum and maximum possible, respectively (see Fig.[8.3[a)). The proof 
of this statement follows from an easy application of the submodularity of 
dg. 
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Rmin Rmax 





Fig. 8.3: (a) A graph G with three edge-disjoint (X, Y )-paths and the (X,Y )- 
cuts Rmin and Rmax of Theorem|8.4| (b) The corresponding residual directed 
graph D defined in the proof of Theorem |8.5 


Theorem 8.4. Let G be a graph and X,Y C V(G) two disjoint sets of ver- 
tices. There are two minimum (X,Y)-cuts A(Rmin) and A(Rmax) such that 
if A(R) is a minimum (X,Y )-cut, then Rmin C R C Rmax- 


Proof. Consider the collection R of every set R C V(G) for which A(R) is a 
minimum (X,Y )-cut. We show that there is a unique inclusion-wise minimal 
set Rmin and a unique inclusion-wise maximal set Rmax in R. Suppose for 
contradiction that A(R) and A(R2) are minimum cuts for two inclusion-wise 
minimal sets Ry # Ro of R. By (8.1), we have 


da(Rı) + de(R2) > da(Rı N R2) + da(Rı U Rə). 


If À is the minimum (X,Y )-cut size, then the left-hand side is exactly 2A, 
hence the right-hand side is at most 2A. Observe that A(Rı N R2) and A(Rı U 
R2) are both (X, Y)-cuts. Taking into account that A is the minimum (X,Y )- 
cut size, the right-hand side is also exactly 2A, with both terms being exactly 
à. That is, A(Rı N Rə) is a minimum (X,Y )-cut. Now Rı # Rə implies that 
Rı O Rə C Rı, Ro, contradicting the assumption that both Rı and Rə are 
inclusion-wise minimal in R. 
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The same argument gives a contradiction if Rı # Rə are inclusion-wise 
maximal sets of the collection: then we observe that A(R, U Rə) is also a 
minimum (X, Y)-cut. 














While the proof of Theorem [8.4]is not algorithmic, one can find, for example, 
Rmin by repeatedly adding vertices to Y as long as this does not increase 
the minimum cut size (Exercise (8.5). However, there is also a linear-time 
algorithm for finding these sets. 


Theorem 8.5. Let G be a graph with n vertices and m edges, and X,Y C 
V(G) be two disjoint sets of vertices. Let k be the size of the minimum (X,Y)- 
cut. The sets Rmin and Rmax of Theorem|8.4| can be found in time O(k(n + 


m)). 


Proof. Let us invoke the algorithm of Theorem |8.2|and let P,,..., Pk be the 
pairwise edge-disjoint X — Y paths returned by the algorithm. We build the 
residual directed graph D as follows. If edge xy of G is not used by any of 
the paths P;, then we introduce both (x,y) and (y, x) into D. If edge ry of 
G is used by some P; in such a way that x is closer to X on path P;, then we 
introduce the directed edgq!| (y, x) into D (see Fig. [8.3{b)). 


We show that Rmin is the set of vertices reachable from X in the residual 
graph D and Rmax is the set of vertices from which Y is not reachable 
in D. 


Let Ag(R) be a minimum (X,Y )-cut of Œ. As Ag(R) is an (X,Y )-cut of 
size k, each of the k paths P,, ..., Py uses exactly one edge of Ag(R). This 
means that after P; leaves R, it never returns to R. Therefore, if P; uses an 
edge ab € Ac(R) with a € Rand b Æ R, then a is closer to X on P;. This 
implies that (a,b) is not an edge of D. As this is true for every edge of the 
cut Ag(R), we get that V(G) \ Ris not reachable from X in D; in particular, 
Y is not reachable. 

Let Rmin be the set of vertices reachable from X in D. We have shown 
in the previous paragraph that Y is not reachable from X in D (that is, 
X C Rmin C V(G) \ Y), hence Ac(Rmin) is an (X,Y)-cut of G. If we can 
show that this cut is a minimum (X, Y)-cut, then we are done: we have shown 
that if Ag(R) is a minimum (X, Y)-cut, then V(G) \ R is not reachable from 
X in D, implying that V(G) \ R C V(G) \ Rmin and hence Rmin C R. 

Every path P; uses at least one edge of the (X,Y )-cut Ag(Rmin). More- 
over, P; cannot use more than one edge of the cut: if P; leaves Rmin and later 
returns to Rmin on an edge ab with a ¢ Rmin, b E Rmin and a closer to X 
on P;, then (b,a) is an edge of D and it follows that a is also reachable from 





1 The reader might find introducing the edge (x,y) instead of (y,a) more natural and 
indeed the rest of the proof would work just as well after appropriate changes. However, 
here we follow the standard definition of residual graphs used in network flow algorithms. 
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X in D, contradicting a Rmin. Therefore, AG(Rmin) can have at most k 
edges, implying that it is a minimum (X,Y)-cut. Therefore, Rmin satisfies 
the requirements. 

A symmetrical argument shows that the set Rmax containing all vertices 
from which Y is not reachable in D satisfies the requirements. 














8.2 Important cuts 


Most of the results of this chapter are based on the following definition. 


Definition 8.6 (Important cut). Let G be an undirected graph and let 
X,Y C V(G) be two disjoint sets of vertices. Let S C E(G) be an (X, Y)-cut 
and let R be the set of vertices reachable from X in G \ S. We say that 
S is an important (X,Y )-cut if it is inclusion-wise minimal and there is no 
(X,Y )-cut S with |S’| < |S| such that R C R’, where R’ is the set of vertices 
reachable from X in G\ S. 


Note that the definition is not symmetrical: an important (X,Y )-cut is 
not necessarily an important (Y, X)-cut. 


An intuitive interpretation of Definition [8.6]is that we want to minimize 
the size of the (X,Y)-cut and at the same time we want to maximize 
the set of vertices that remain reachable from X after removing the 
cut. The important (X,Y)-cuts are the (X,Y)-cuts that are “Pareto 
efficient” with respect to these two objectives: increasing the set of ver- 
tices reachable from X requires strictly increasing the size of the cut. 


Let us point out that we do not want the number of vertices reachable 
from X to be maximal, we just want that this set of vertices be inclusion- 
wise maximal (i.e., we have R C R’ and not |R| < |R’| in the definition). 

The following proposition formalizes an immediate consequence of the def- 
inition. This is the property of important (X,Y)-cuts that we use in the 
algorithms. 


Proposition 8.7. Let G be an undirected graph and X,Y C V(G) two dis- 
joint sets of vertices. Let S be an (X,Y)-cut and let R be the set of vertices 
reachable from X in G\ S. Then there is an important (X,Y )-cut S' = A(R’) 
(possibly, S! = S) such that |S"| < |S| and RC R. 


Proof. By definition, every (X,Y )-cut has a subset that is an inclusion-wise 
minimal (X,Y )-cut. Let S* C S be a minimal (X,Y )-cut and let R* D R be 
the set of vertices reachable from X in G \ S*. If S* is an important (X, Y)- 
cut, then we are done. Otherwise, there is an (X,Y )-cut 5” = A(R’) for some 
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Y 


Fig. 8.4: A graph where every minimal (X, Y)-cut is an important (X, Y )-cut 


R' > R* D Rand |S"| < |S*| < |S]. If S is an important (X,Y)-cut, then 
we are done. Otherwise, we can repeat the argument: each time we strictly 
increase the set of vertices reachable from X and the size of the cut does 
not increase. Eventually, the process has to stop and we obtain an important 
(X, Y)-cut satisfying the required properties. 














To check whether a given (X,Y)-cut A(R) is important, one needs to 
check whether there is another (X, Y)-cut of the same size “after” R, that is, 
whether there is an (RU v, Y)-cut of size not larger for some v € V(G) \ R. 
This can be done efficiently using Theorem |8.5 


Proposition 8.8. Given a graph G with n vertices and m edges, two disjoint 
sets X,Y CV(G), and an (X,Y)-cut A(R) of size k, it can be tested in time 
O(k(n+m)) whether A(R) is an important cut. 


Proof. Observe that A(R) is an important X — Y cut if and only if it is the 
unique minimum (R, Y) cut. Therefore, if we compute the minimum (R, Y)- 
cut A(Rmax) using the algorithm of Theorem|8.5} then A(R) is an important 
(X,Y)-cut if and only if R = Rmax- 














Theorem |8.4]shows that A(Rmax) is the unique minimum (X, Y)-cut that 
is an important (X,Y)-cut: we have Rmax D R for every other minimum 
(X,Y )-cut A(R). However, for sizes larger than the minimum cut size, there 
can be a large number of incomparable (X, Y )-cuts of the same size. Consider 
the example in Fig. Any (X,Y)-cut A(R) is an important (X,Y )-cut: 
the only way to extend the set R is to move some vertex v; E€ V(G) \ (RUY) 
into the set R, but then the cut size increases by 1. Therefore, the graph has 
exactly (2) important (X,Y )-cuts of size n + 2. 

The main result of the section is a bound on the number of important cuts 
of size at most k that depends only on k and is independent of the size of the 
graph. Moreover, we show that the important separators can be efficiently 
enumerated. We need first the following simple observations, whose proofs 
are given as Exercise 


Proposition 8.9. Let G be a graph, X,Y C V(G) be two disjoint sets of 
vertices, and S = A(R) be an important (X,Y )-cut. 
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1. For every e € S, the set S \ {e} is an important (X,Y)-cut in G \ e. 
2. If S is an (X', Y)-cut for some X’ D X, then S is an important (X', Y)- 
cut. 


The first key observation in bounding the number of important (X,Y )- 
cuts is that every important (X, Y)-cut is after the set Rmax of Theorem |8.4 


Lemma 8.10. If A(R) is an important (X,Y)-cut, then Rmax C R. 
Proof. Let us apply the submodular inequality (8.1) on Rmax and R: 
dg(Rmax) Is dg(R) 2 dg(Rmax N R) T dq (Rmax U R). 


Let À be the minimum (X,Y )-cut size. The first term da(Rmax) on the left- 
hand side is exactly A (as Ag (Rmax) is a minimum (X, Y)-cut). Furthermore, 
Ac(Rmax N R) is an (X,Y)-cut, hence we have that the first term on the 
right-hand side is at least A. It follows then that the second term of the 
left-hand side is at least the second term on the right-hand side, that is, 
dg(R) > de(R U Rmax). If Rmax Z R, then Rmax U R is a proper superset 
of R. However, then the (X,Y )-cut Ag(Rmax U R) of size at most dg(R) 
contradicts the assumption that Ag(R) is an important (X, Y )-cut. Thus we 
have proved that Rmax C R. 














We are now ready to present the bound on the number of important cuts. 


Theorem 8.11. Let X,Y C V(G) be two disjoint sets of vertices in graph G 
and let k > 0 be an integer. There are at most 4: important (X,Y )-cuts of 
size at most k. 


Proof. We prove that there are at most 2?*—* important (X,Y)-cuts of size 


at most k, where A is the size of the smallest (X, Y)-cut. Clearly, this implies 
the upper bound 4* claimed in the theorem. The statement is proved by 
induction on 2k — A. If À > k, then there is no (X,Y)-cut of size k, and 
therefore the statement holds if 2k — A < 0. Also, if A = 0 and k > 0, then 
there is a unique important (X,Y )-cut of size at most k: the empty set. 


The proof is by branching on an edge xy leaving Rmax: an important 
(X, Y)-cut either contains xy or not. In both cases, we can recurse on 
an instance where the measure 2k — A is strictly smaller. 


Let A(R) be an important (X,Y)-cut and let A(Rmax) be the minimum 
(X, Y)-cut defined by Theorem [8.4] By Lemma|8.10] we have Rmax C R. As 
we have assumed A > 0, there is at least one edge xy with z € Rmax C R 
and y ¢ Rmax. Then vertex y is either in R or not. If y ¢ R, then zy is 
an edge of the cut A(R) and then S \ {xy} is an important (X,Y )-cut in 
G = G \ zy of size at most k’ := k — 1 (Prop. [8-9{1)). Removing an edge 
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can decrease the size of the minimum (X,Y)-cut size by at most 1, hence 
the size \’ of the minimum (X,Y)-cut in G” is at least A — 1. Therefore, 
2k’ — X' < 2k — à and the induction hypothesis implies that there are at most 
22k'—A' < 92k-—1 important (X,Y)-cuts of size k’ in G”, and hence at most 
that many important (X,Y )-cuts of size k in G that contain the edge zy. 

Let us count now the important (X, Y)-cuts not containing the edge xy. As 
Rmax C R, the fact that vy is not in the cut implies that even Rmax U{y} C R 
is true. Let X’ = Rmax U {y}; it follows that A(R) is an (X’, Y)-cut and in 
fact an important (X’, Y)-cut by Prop.|8.9|/2). There is no (X’, Y)-cut A(R’) 
of size À: such a cut would be an (X,Y)-cut with Rmax C Rmax U fy} C R’, 
contradicting the definition of Rmax. Thus the minimum size \ of an (X’, Y)- 
cut is greater than A. It follows by the induction assumption that the number 
of important (X’, Y)-cuts of size at most k is at most 22%- < 22k—à-1 which 
is a bound on the number of important (X,Y)-cuts of size k in G that do 
not contain xy. 

Adding the bounds in the two cases, we get the required bound 22". 














The main use of the bound in Theorem is for algorithms based on 
branching: as the number of important (X, Y )-cuts is bounded by a function 
of k, we can afford to branch on selecting one of them (see for example 
Theorem|8.19}. The following refined bound is very helpful for a tighter bound 
on the size of the search tree for such algorithms. Its proof is essentially the 
same as the proof of Theorem|8.11|with an appropriate induction statement 


(see Exercise (8.9). 


Lemma 8.12. Let G be an undirected graph and let X,Y C V(G) be two 
disjoint sets of vertices. If S is the set of all important (X,Y)-cuts, then 


Vaes t Ps 


Observe that Lemma implies Theorem the contribution of each 
important (X,Y )-cut to the sum is at least 47%, hence the fact that the sum 
is at most 1 implies that there can be at most 4" such cuts. 

So far, Theorem|8.11]and Lemmal8.12]are purely combinatorial statements 
bounding the number of important cuts. However, an algorithm for enumer- 
ating all of the important cuts follows from the proof of Theorem [8.11] First, 
we can compute Rmax using the algorithm of Theorem [8.5] Pick an arbitrary 
edge xy E€ A(Rmax). Then we branch on whether edge xy is in the important 
cut or not, and recursively find all possible important cuts for both cases. The 
search tree of the algorithm has size at most 4" and the work to be done in 
each node is O(k(n+m)). Therefore, the total running time of the branching 
algorithm is O(4* - k(n + m)) and it returns at most 4* cuts. 

However, there is an unfortunate technical detail with the algorithm de- 
scribed above. The algorithm enumerates a superset of all important cuts: 
by our analysis, every important cut is found, but there is no guarantee that 
all the constructed cuts are important (see Exercise [8.8). We present three 
different ways of handling this issue. The most obvious solution is to perform 
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a filtering phase where we use Proposition [8.8] to check for each returned cut 
whether it is important. This filtering phase takes time O(4* - k(n + m)). 


Theorem 8.13. Let X,Y C V(G) be two disjoint sets of vertices in a graph 
G with n vertices and m edges, and let k > 0 be an integer. The set of all 
important (X,Y )-cuts of size at most k can be enumerated in time O(4* - k - 
(n+m)). 


When using the refined bound of Lemma|8.12|to bound the running time of a 
branching algorithm, we need that the time spent in each node of the search 
tree be proportional to the number of directions into which we branch. This is 
unfortunately not true if we use Theorem|8.13}to enumerate all the important 
cuts: the 4* factor appears in the time bound, even if the number of actual 
important cuts returned is much less than 4". To obtain an algorithm that 
does not suffer from this problem, let Sj, be the set of all (X,Y )-cuts found 
before the filtering. Observe that if the algorithm considers only branches 
where k > A (otherwise there is no (X,Y )-cut of size at most k), then every 
leaf node has À = 0 and hence produces a cut, which means that the recursion 
tree has |S; | leaves. Furthermore, the height of the recursion tree is at most 
k, thus the recursion tree has O(k|S;,|) nodes. The work to be done in each 
node is O(k(n+m)). Therefore, we obtain the following bound on the running 
time. 


Theorem 8.14. Let X,Y C V(G) be two disjoint sets of vertices in graph G 
with n vertices and m edges, let k > 0 be an integer. One can enumerate a 
superset Si, of every important (X,Y)-cut of size at most k in time O(|S;| - 
k?-(n+m)). Moreover, the set Si, satisfies doses! AWS) < 1. 


Theorem gives a more refined bound on the running time than Theo- 
rem It may appear somewhat confusing, however, that the algorithm 
of Theorem [8.14] returns cuts that are not (X,Y)-cuts. What Theorem 
really says is that we have good bounds both on the running time and on the 
number of cuts returned even if we consider these cuts and do not restrict 
the output to the important (X, Y)-cuts. One may prefer a cleaner statement 
about the existence of an algorithm that returns only important (X, Y)-cuts 
and the running time depends linearly on the size of the output. It is possi- 
ble to obtain such an algorithm, but it requires an additional combinatorial 
argument. We need to ensure that the size of the recursion tree is O(k|S;]), 
where Spx is the set of all important (X,Y )-cuts of size at most k. We achieve 
this by checking before each recursion step whether at least one important 
(X,Y )-cut of size at most k will be returned; if not, then there is no need 
to perform the recursion. This way, we ensure that each leaf of the recursion 
tree returns an important (X,Y )-cut, hence the size of the recursion tree can 
be bounded by O(k|S;,|). The check before the recursion is made possible by 
the following lemma. 


Lemma 8.15. Let X,Y C V(G) be two disjoint sets of vertices in a graph 
G with n vertices and m edges, let k > 0 be an integer, and let Z C A(X) 
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be a set of edges. It can be checked in time O(k(n + m)) whether there is an 
important (X,Y)-cut S of size at most k with Z CS. 


Proof. Let G’ = G\ Z, let C = Ag (Rix) be the minimum (X, Y)-cut of G” 
given by Theorem [8.4] We claim that G has an important (X, Y)-cut of size 
at most k containing Z if and only Z UC is such a cut. Suppose that there is 
a D C E(G’) such that ZU D is an important (X,Y )-cut of size at most k. 
As ZUD is a minimal (X,Y)-cut, we can write it as Ag(Rp) for some set 
Rp 2 X. Each edge of Z C Ag(X) has an endpoint not in X; let P contain 
all these endpoints. Observe that Z C Ag(Rp) implies that Rp is disjoint 
from P. 

Clearly, Z UC is an (X,Y)-cut in G, thus if it is not important, then 
Proposition [8.7] implies that there is an important (X,Y)-cut S = A(Rg) 
with [S| < |ZU C| and Rl aax C Rg. If Z C S, then S\Z = Aga (Rs) is an 
(X,Y)-cut in G’ with |S \ Z| < |C|. By the definition of Rlaax, this is only 
possible if |S \ Z| = |C| and Rs = Ri,,,, which would imply S = Z U C. 
Therefore, at least one edge of Z is not in S, which implies that Rs contains 
at least one vertex of P. 


Consider now the submodular inequality on Rs and Rp. 
da(Rs) + de(Rp) > de(Rs N Rp) + de(Rs U Rp) 


Observe that Ag(RsURp) is an (X, Y)-cut and RgURp is a proper superset 
of Rp (as Rp is disjoint from P and Rg intersects P). Thus the fact that ZUD 
is an important (X,Y)-cut implies da(Rs U Rp) > d¢(Rp). It follows that 
dg(RsN Rp) < dg(Rs) <|ZUC| has to hold. Note that Z C Ag(Rs N Rp) 
as X C Rg Rp and Rp (and hence Rg N Rp) is disjoint from P. Therefore, 
Aa(RsN Rp) can be written as ZUC* for some C* C E(G') disjoint from Z. 
Now |ZUC*| = da(Rs O Rp) < |ZU C| implies |C*| < |C]. This contradicts 
the assumption that C is a minimum (X,Y )-cut in G”. 














Equipped with Lemma one can modify the branching algorithm in a 
way that it enumerates only important (X,Y )-cuts. (Exercise asks for 
working out the details of this algorithm.) 


Theorem 8.16. Let X,Y C V(G) be two disjoint sets of vertices in a graph 
G with n vertices and m edges, and let k > 0 be an integer. The set Sk 
of all important (X,Y )-cuts of size at most k can be enumerated in time 


O(|Sk| +k? - (n +m). 


The reader might wonder how tight the bound 4* in Theorem is. The 
example in Fig. [8.4] already showed that the number of important cuts of size 
at most k can be exponential in k. We can show that the bound 4? is tight 
up to polynomial factors; in particular, the base of the exponent has to be 4. 
Consider a rooted complete binary tree T with n > k levels; let X contain 
only the root and Y contain all the leaves (see Fig. |8.5). Then every rooted 
full binary subtree T’ with k leaves gives rise to an important (X,Y )-cut of 
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Fig. 8.5: A graph with O(4* /k/?) important (X,Y )-cuts of size k: every full 
binary subtree with k leaves gives rise to an important (X,Y )-cut of size k. 
The figure shows a subtree with six leaves and the corresponding (X, Y )-cut 
of size 6 (red edges) 


size k. Indeed, the cut A(R) obtained by removing the edges incident to the 
leaves of T” is an important (X, Y)-cut, as moving more vertices to R would 
clearly increase the cut size. It is well known that the number of full subtrees 
with exactly k leaves of a rooted complete binary tree is precisely the Catalan 
number Cp = (1/k)(7477) = @(4*/k3/2). 

As an application of the bound on the number of important cuts, we 
can prove the following surprisingly simple, but still nontrivial combinatorial 
result: only a bounded number of edges incident to Y are relevant to (X,Y )- 
cuts of size at most k. 


Lemma 8.17. Let G be an undirected graph and let X,Y C V(G) be two 
disjoint sets of vertices. The union of all minimal (X,Y )-cuts of size at most 
k contains at most k - 4" edges incident to Y. 


Proof. Let F contain an edge e if it is incident to a vertex of Y and it 
appears in an (X,Y )-important cut of size at most k. By Theorem [8.11] we 
have |F| < 4* -k. To finish the proof of the lemma, we show that if an edge 
e incident to Y appears in some minimal (X,Y)-cut A(R) of size at most 
k, then e E€ F. Proposition [8.7]implies that there is an important (X,Y )-cut 
A(R’) of size at most k with R C R’. Edge e has an endpoint x € R and an 
endpoint y ¢ R. As RNY = f, the endpoint y of e has to be in Y. Now we 
have x € RC R' and y € Y CV(G)\R’, hence e is an edge of A(R’) as well, 
implying e € F. 
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8.3 EDGE MULTIWAY CUT 


Let G be a graph and T C V(G) be a set of terminals. An edge multiway 
cut is a set S of edges such that every component of G \ S contains at most 
one vertex of T. Given a graph G, terminals T, and an integer k, the EDGE 
MULTIWAY CUT problem asks if a multiway cut of size at most k exists. If 
|T| = 2, that is, T = {t1, t2}, then EDGE Mutttway Cut is the problem of 
finding a (tı, t2)-cut of size at most k, hence it is polynomial-time solvable. 
However, the problem becomes NP-hard already for |T| = 3 terminals [124]. 

In this section, we show that EDGE MULTIWAY CuT is FPT parameterized 
by k. The following observation connects EDGE MULTIWAY CUT and the 
concept of important cuts: 


Lemma 8.18 (Pushing lemma for EDGE MULTIWAy CUT). Let t € T be 
an arbitrary terminal in an undirected graph G. If G has a multiway cut S, 
then it also has a multiway cut S* with |S*| < |S| such that S* contains an 
important (t, T \ t)-cut. 


Proof. If t is separated from T \ t in G, then the statement trivially holds, as 
the empty set is an important (t, T\t)-cut. Otherwise, let R be the component 
of G \ S containing t. As S is a multiway cut, R is disjoint from T \ ¢ and 
hence Sr = A(R) is a (t, T \ t)-cut contained in S. By Proposition [8.7] there 
is an important (t,T \ t)-cut S’ = A(R’) with R C R’ and |S"| < |Sp| (see 
Fig. (8.6). We claim that S* = (S \ Sr) US’, which has size at most |S], is 
also a multiway cut, proving the statement of the lemma. 


Replacing the (t, T\t)-cut Sp in the solution with an important (t, T\t)- 
cut S cannot break the solution: as it is closer to T \ t, it can be even 
more helpful in separating the terminals in T \ ¢ from each other. 


Clearly, there is no path between t and any other terminal of T \ ¢ in 
G \ S*, as S’ C S* is a (t, T \ t)-cut. Suppose therefore that there is a path 
P between two distinct terminals t),t2 € T \ t in G\ S*. If P goes through 
a vertex of R C R’, then it goes through at least one edge of the (t,T \ t) 
cut S’ = A(R’), which is a subset of S*, a contradiction. Therefore, we may 
assume that P is disjoint from R. Then P does not go through any edge of 
Sr = A(R) and therefore the fact that P is disjoint from S* implies that it 
is disjoint from S as well, contradicting the assumption that S is a multiway 
cut. 














Using this observation, we can solve the problem by branching on the 
choice of an important cut and including it into the solution: 


Theorem 8.19. EDGE MULTIWAY CUT on a graph with n vertices and m 
edges can be solved in time O(4* - kë - (n + m)). 
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Fig. 8.6: Replacing A(R) with A(R’) in the proof of Lemma does not 
break the solution: if a path connecting t),t2 € T \ t enters R, then it has to 
use an edge of A(R’) as well 


Proof. We solve the problem by a recursive branching algorithm. If all the 
terminals are separated from each other, then we are done. Otherwise, let 
t € T be a terminal not separated from the rest of the terminals. Let us use 
the algorithm of Theorem [8.16] to construct the set Sp consisting of every 
important (t, T \ t)-cut of size at most k. By Lemma [8-18] there is a solution 
that contains one of these cuts. Therefore, we branch on the choice of one of 
these cuts: for every important cut S” € Sk, we recursively solve the EDGE 
MULTIWAY CUT instance (G \ S$’, T, k— |9'|). If one of these branches returns 
a solution S, then clearly SUS’ is a multiway cut of size at most k in G. 
The correctness of the algorithm is clear from Lemma [8.18] We claim that 
the search tree explored by the algorithm has at most 4” leaves. We prove 
this by induction on k, thus let us assume that the statement is true for 
every value less than k. This means that we know that the recursive call 
(G\ S',T,k — |S'|) explores a search tree with at most 4*~!5'! leaves. Note 
that the value of k — |S'| is always nonnegative and the claim is trivial for 
k = 0. Using Lemma [8.12] we can bound the number of leaves of the search 


tree by 
5 4k-15'| < 4h. 5 47|8'| <a 
S’ES, S’ES;, 


Therefore, the total time spent at the leaves can be bounded by O(4*k(n + 
m)). As the height of the search tree is at most k, it has O(k4*) nodes. If 
we use the algorithm of Theorem|8.16]to enumerate Sp, then the time spent 
at an internal node with t children is O(tk?(n + m)). Summing the number 
of children for every internal node is exactly the number of edges, and hence 
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can be bounded by O(k4*). Therefore, the total work in the internal nodes 
is O(4*k3(n + m)). 


A slight generalization of EDGE MuULTIWAY CuT is EDGE MULTIWAY CUT 
FOR SETS, where the input is a graph G, pairwise disjoint sets Ti, ..., Tp, 
and an integer k; the task is to find a set S of edges that pairwise separates 
these sets from each other, that is, there is no T; — T; path in G \ S for any 
i Æ j. EDGE MULTIWAY CUT FOR SETS can be reduced to EDGE MULTIWAY 


CuT simply by consolidating each set into a single vertex. 














3 


Theorem 8.20. EDGE MULTIWAY CUT FOR SETS on a graph with n vertices 
and m edges can be solved in time O(4* - kè - (n + m)). 


Proof. We construct a graph G’ as follows. The vertex set of G’ is (V (G) \ 
U?_, T;) UT, where T = {v1,..., Up}. For every edge zy € E(G), there is a 
corresponding edge in G’: if endpoint x or y is in T;, then it is replaced by 
vi. It is easy to see that there is a one-to-one correspondence between the 
solutions of the EDGE MULTIWAY CUT FOR SETS instance (G, (Ti, ... , Tp), k) 
and the EDGE Murtiway Curt instance (G',T, k). 


3 














Another well-studied generalization of EDGE MULTIWAY CuT can be ob- 
tained if, instead of requiring that all the terminals be separated from each 
other, we require that a specified set of pairs of terminals be separated from 
each other. The input in the EDGE MULTICUT problem is a graph G, pairs 
(si1,t1), ---, (Se, te), and an integer k; the task is to find a set S of at most 
k edges such that G \ S has no s; — ti path for any 1 < i < 4. If we have 
a bounded number of pairs of terminals, EDGE MULTICUT can be reduced 
to EDGE MULTIWAY CUT FOR SETS: we can guess how the components of 
G \ S for the solution S' partition the 2¢ vertices s;, t; (1 < i < £) and solve 
the resulting EDGE MULTIWAY CUT FOR SETS instance. The 2¢ terminals 
can be partitioned in at most (2¢)?° different ways, thus we can reduce EDGE 
MULTICUT into at most that many instances of EDGE MULTIWAY CUT FOR 
SETS. This shows that EDGE MULTICUT is FPT with combined parameters k 
and /. We may also observe that the removal of at most k edges can partition 
a component of G into at most (k+1) components, hence we need to consider 
at most (k + 1)? partitions. 


Theorem 8.21. EDGE MULTICUT on a graph with n vertices and m edges 
can be solved in time O((24)” - 4* - k3(n + m)) or in time O((k + 1)” 4* . 
k3(n+m)). 


It is a more challenging question whether the problem is FPT parameter- 
ized by k (the size of the solution) only. The proof of this requires several non- 
trivial technical steps and is beyond the scope of this chapter. We remark that 
one of the papers proving the fixed-parameter tractability of EDGE MULTI- 
CUT was the paper introducing the random sampling of important separators 
technique : 


Theorem 8.22 ([59), ). EDGE MULTICUT on an n-vertex graph can be 
solved in time 208") . n20, 
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8.4 (p, q)-clustering 


The typical goal in clustering problems is to group a set of objects such 
that, roughly speaking, similar objects appear together in the same group. 
There are many different ways of defining the input and the objective of 
clustering problems: depending on how similarity of objects is interpreted, 
what constraints we have on the number and size of the clusters, and how 
the quality of the clustering is measured, we can define very different problems 
of this form. Often a clustering problem is defined in terms of a (weighted) 
graph: the objects are the vertices and the presence of a (weighted) edge 
indicates similarity of the two objects and the absence of an edge indicates 
that the two objects are not considered to be similar. 

In this section, we present an algorithm for a particular clustering prob- 
lem on graphs that is motivated by partitioning circuits into several field 
programmable gate arrays (FPGAs). We say that a set C C V(G) is a (p,q)- 
cluster if |C| < p and d(C) < q. A (p,q)-partition of G is a partition of 
V(G) into (p,q)-clusters. Given a graph G and integers p and q, the (p, q)- 
PARTITION problem asks if G has a (p,q)-partition. The main result of this 
section is showing that (p,q)-PARTITION is FPT parameterized by q. The 
proof is based on the technique of random sampling of important separators 
and serves as a self-contained demonstration of this technique. 


e An uncrossing argument shows that the trivial necessary condition 
that every vertex is in a (p, q)-cluster is also a sufficient condition. 

e Checking whether v is in a (p, q)-cluster is trivial in nO time, but 
an FPT algorithm parameterized by q is more challenging. 

e The crucial observation is that a minimal (p, q)-cluster containing v 
is surrounded by important cuts. 

e A randomized reduction allows us to reduce finding a (p, qg)-cluster 
containing v to a knapsack-like problem. 

e The derandomization of the reduction uses splitters, introduced in 
Section 


A necessary condition for the existence of (p, q)-partition is that for every 
vertex v € V(G) there exists a (p, q)-cluster that contains v. Very surprisingly, 
it turns out that this trivial necessary condition is actually sufficient for the 
existence of a (p,q)-partition. The proof (Lemma needs a variant of 
submodularity: we say that a set function f : 2V(®) > R is posimodular if it 
satisfies the following inequality for every A,B C V(G): 


f(A) + f(B) > F(A\ B) + f(B\ A). (8.2) 


This inequality is very similar to the submodular inequality (8.1) and the 
proof that dg has this property is a case analysis similar to the proof of 
Theorem |8.3] The only difference is that edges of type 4 contribute 2 to the 
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left-hand side and 0 to the right-hand side, while edges of type 5 contribute 
2 to both sides. 


Theorem 8.23. The function dg is posimodular for every undirected graph 
G. 


More generally, it can be shown that every symmetric submodular function 
(that is, when f(X) = f(V(G)\ X) for every X C V(G)) is posimodular (see 
Exercise [8.3). 

We are now ready to prove that every vertex being in a (p, q)-cluster is a 
sufficient condition for the existence of a (p, q)-partition. 


Lemma 8.24. Let G be an undirected graph and let p,q > 0 be two integers. 
If every v E€ V(G) is contained in some (p,q)-cluster, then G has a (p,q)- 
partition. Furthermore, given a set of (p,q)-clusters C1, ..., Cn whose union 
is V(G), a (p,q)-partition can be found in polynomial time. 


Proof. Let us consider a collection C1, ..., Cn of (p, q)-clusters whose union is 
V(G). If the sets are pairwise disjoint, then they form a partition of V (G) and 
we are done. If C; C C}, then the union remains V(G) even after throwing 
away Ci. Thus we can assume that no set is contained in another. 


The posimodularity of the function dg allows us to uncross two clusters 
Ci and C} if they intersect. 


Suppose that C; and C} intersect. Now either d(C;) > d(C;\C;) or d(C;) > 
d(C; \ C;) must be true: it is not possible that both d(C;) < d(C; \ C4) and 
d(C;) < d(Cj\C;) hold, as this would violate the posimodularity of d. Suppose 
that d(C;) > d(C; \ C;). Now the set C; \ C; is also a (p, q)-cluster: we have 
d(C; \ Ci) < d(C;) < q by assumption and |C} \ C;| < |Cj| < p. Thus we can 
replace C; by C} \ C; in the collection: it will remain true that the union of 
the clusters is V(G). Similarly, if d(C;) > d(C; \ C;), then we can replace C; 
by Ci \ Cj. 

Repeating these steps (throwing away subsets and resolving intersections), 
we eventually arrive at a pairwise-disjoint collection of (p,q)-clusters. Each 
step decreases the number of cluster pairs (C;,C;) that have nonempty in- 
tersection. Therefore, this process terminates after a polynomial number of 
steps. 














The proof of Lemma might suggest that we can obtain a partition 
by simply taking, for every vertex v, a (p, q)-cluster C, that is inclusion-wise 
minimal with respect to containing v. However, such clusters can still cross. 
For example, consider a graph on vertices a, b, c, d where every pair of vertices 
except a and d are adjacent. Suppose that p = 3, q = 2. Then {a,b,c} is a 
minimal cluster containing b (as more than two edges are going out of each 
of {b}, {b,c}, and {a,b}) and {b, c,d} is a minimal cluster containing c. Thus 


266 8 Finding cuts and separators 


unless we choose the minimal clusters more carefully in a coordinated way, 
they are not guaranteed to form a partition. In other words, there are two 
symmetric solutions ({a, b,c}, {d}) and ({a}, {b,c,d}) for the problem, and 
the clustering algorithm has to break this symmetry somehow. 

In light of Lemma [8.24] it is sufficient to find a (p, q)-cluster C, for each 
vertex v € V(G). If there is a vertex v for which there is no such cluster 
Cy, then obviously there is no (p, q)-partition; if we have such a C, for every 
vertex v, then Lemma [8.24] gives us a (p,q)-partition in polynomial time. 
Therefore, in the rest of the section, we are studying the (p,q)-CLUSTER 
problem, where, given a graph G, a vertex v € V(G), and integers p and q, 
it has to be decided if there is a (p, q)-cluster containing v. 

For fixed q, the (p,q)-CLUSTER problem can be solved by brute force: 
enumerate every set F of at most q edges and check if the component of 
G \ F containing v is a (p,q)-cluster. If C, is a (p,q)-cluster containing v, 
then we find it when F = A(C,,) is considered by the enumeration procedure. 


Theorem 8.25. (p,q)-CLUSTER can be solved in time nO) on an n-verter 
graph. 


The main result of the section is showing that it is possible to solve (p, q)- 
CLUSTER (and hence (p,q)-PARTITION) more efficiently: in fact, it is fixed- 
parameter tractable parameterized by q. By Lemma|8.24] all we need to show 
is that (p,q)-CLUSTER is fixed-parameter tractable parameterized by g. We 
introduce a somewhat technical variant of (p,q)-CLUSTER, the SATELLITE 
PROBLEM, which is polynomial-time solvable. Then we show how to solve 
(p, q)-CLUSTER using an algorithm for the SATELLITE PROBLEM. 

The input of the SATELLITE PROBLEM is a graph G, integers p, q, a vertex 
v € V(G), and a partition (Vo, Vi,..., Vp) of V(G) such that v € Vo and there 
is no edge between V; and Vj for any 1 <i < j < r. The task is to find a (p, q)- 
cluster C satisfying Vo C C such that for every 1 <i < r, either CNV; = Q 
or V; C C (see Fig. [8.7). 

Since the sets {V;} form a partition of V(G), we have r < n. For every 
V; (1 <i < r), we have to decide whether to include or exclude it from the 
solution cluster C. If we exclude V; from C, then d(C) increases by d(V;j), 
the number of edges between Vo and V;. If we include V; into C, then |C| 
increases by |C]. 


To solve SATELLITE PROBLEM, we need to solve the knapsack-like prob- 
lem of including sufficiently many V;’s such that d(C) < q, but not 
including too many to ensure |C| < p. 


Lemma 8.26. The SATELLITE PROBLEM can be solved in polynomial time. 


Proof. For a subset S of {1,...,r}, we define C(S) = Vo UUjeg Vi- Notice 
that d(C(S)) = d(Vo)— X ies d(Vi). Hence, we can reformulate the SATELLITE 
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Fig. 8.7: Instance of SATELLITE PROBLEM with a solution C. Excluding V2 
and V4 from C decreased the size of C by the grey area, but increased d(C) 
by the red edges 


PROBLEM as finding a subset S of {1,...,r} such that >7,.¢d(Vi) > d(Vo)—q 
and Jes |Vil < p — |Vo|. Thus, we can associate with every i an item with 
value d(V;) and weight |V;|. The objective is to find a set of items with total 
value at least Vtarget := d(Vo) — q and total weight at most Wmax := p — |Vo|. 
This problem is known as KNAPSACK and can be solved in polynomial time by 
a classical dynamic-programming algorithm in time polynomial in the number 
r of items and the maximum weight Wmax (assuming that the weights are 
positive integers). In our case, both r and Wmax are polynomial in the size 
of the input, hence a polynomial-time algorithm for SATELLITE PROBLEM 
follows. 

For completeness, we briefly sketch how the dynamic-programming algo- 
rithm works. For 0 < i < r and 0 < j < Wmax, we define T[i, j] to be the 
maximum value of a set S C {1,...,i} that has total weight at most j. By 
definition, T[0, 7] = 0 for every j. Assuming that we have computed T[i— 1, j] 
for every j, we can then compute T[i, j| for every j using the following re- 
currence relation: 


T[i, j] = max {T[i = 1, j], T[i = 1, j = lVil] + d(V;)}. 


That is, the optimal set S either does not include item i (in which case it is 
also an optimal set for T[i — 1, j]), or includes item 7 (in which case removing 
item 7 decreases the value by d(V;), decreases the weight bound by |V;|, and 
what remains should be optimal for T[i — 1, j —|V;|]). After computing every 
value T[i,j], we can check whether v is in a suitable cluster by checking 
whether Tr, Wmax] > Vtarget holds. 














What remains to be shown is how to reduce (p, g)-CLUSTER to the SATEL- 
LITE PROBLEM. We first present a randomized version of this reduction as it 
is cleaner and conceptually simpler. Then we discuss how splitters, introduced 
in Section [5.6] can be used to derandomize the reduction. 
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At this point it is highly unclear how to reduce (p,q)-CLUSTER to the 
SATELLITE PROBLEM. In the latter problem, the parts V; that may or 
may not be included into a cluster are very well behaved (disjoint and 
nonadjacent), whereas in (p,q)-CLUSTER we do not have any insight 
yet into the structure of connected components of G \ C, where C is the 
cluster we are looking for. 

First, the crucial observation is that we may assume that each con- 
nected component of G'\ C is surrounded by an important cut, leading 
to only an FPT number of candidates for such components. Second, we 
randomly filter out a fraction of the candidates, so that we can obtain 
a SATELLITE PROBLEM instance from the remaining ones. 


The following definition connects the notion of important cuts with our 
problem. 


Definition 8.27 (Important set). We say that a set X C V(G), v ¢ X is 
important if 


1. d(X) < q, 
2. G| X] is connected, 
3. there is no Y D X, v ¢ Y such that d(Y) < d(X) and G[Y] is connected. 


It is easy to see that X is an important set if and only if A(X) is an important 
(u, v)-cut of size at most q for every u € X. Thus we can use Theorem [8.11]to 
enumerate every important set, and Lemma |8.12]to give an upper bound on 
the number of important sets. The following lemma establishes the connection 
between important sets and finding (p, q)-clusters: we can assume that the 
components of G \ C for the solution C are important sets. 


Lemma 8.28. Let C be an inclusion-wise minimal (p,q)-cluster containing 
v. Then every component of G\ C is an important set. 


Proof. Let X be a component of G \ C. It is clear that X satisfies the first 
two properties of Definition (note that A(X) C A(C) implies d(X) < 
d(C) < q). Thus let us suppose that there is a Y D X, v ¢ Y such that 
d(Y) < d(X) and G[Y] is connected. Let C” := C \ Y. Note that C” is a 
proper subset of C: every neighbor of X is in C, thus a connected superset 
of X has to contain at least one vertex of C. It is easy to see that C” is a 
(p, q)-cluster: we have A(C’) C (A(C)\ A(X)) UA(Y) and therefore d(C’) < 
d(C) — d(X) + d(Y) < d(C) < q and |C"| < |C| < p. This contradicts the 
minimality of C. 














We are now ready to present the randomized version of the reduction. 


Lemma 8.29. Given an n-vertez graph G, vertex v € V(G), and integers p 
and q, we can construct in time 2° -n° an instance I of the SATELLITE 
PROBLEM such that 
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e If some (p,q)-cluster contains v, then I is a yes-instance with probability 
270), 


e If there is no (p,q)-cluster containing v, then I is a no-instance. 


Proof. We may assume that G is connected: if there is a (p,q)-cluster con- 
taining v, then there is such cluster contained in the connected component 
of v. 

For every u € V(G), u Æ v, let us use the algorithm of Theorem [8.11] to 
enumerate every important (u,v)-cut of size at most q. For every such cut 
S, let us put the component K of G \ S containing u into the collection X. 
Note that the same component K can be obtained for more than one vertex 
u, but we put only one copy into ¥. 

Let X’ be a subset of X, where each member K of Æ is chosen with 
probability 474(£) independently at random. Let Z be the union of the sets in 
x’, let Vi, ..., Vp be the connected components of G[Z], and let Vo = V (G) \ 
Z. It is clear that Vo, Vi, ..., Vp describe an instance I of the SATELLITE 
PROBLEM, and a solution for I gives a (p,q)-cluster containing v. Thus we 
only need to show that if there is a (p,q)-cluster C containing v, then J is a 
yes-instance with probability 270%. 


We show that the reduction works if the union Z of the selected im- 
portant sets satisfies two constraints: it has to cover every component 
of G \ C and it has to be disjoint from the vertices on the boundary 


of C. The probability of the event that these constraints are satisfied is 
9-O(4). 


Let C be an inclusion-wise minimal (p, q)-cluster containing v. We define 
O(C) to be the border of C: the set of vertices in C with a neighbor in V(G)\C, 
or in other words, the vertices of C incident to A(C). Let K1, ..., Ki be the 
components of G \ C. Note that every_edge of A(C) enters some K;, thus 
va d(K;) = d(C) < q. By Lemma every K; is an important set, and 
hence it is in X. Consider the following two events: 


(E1) Every component K; of G \ C is in X’ (and hence K; C Z). 
(E2) ZN A(C) = 9. 


The probability that (E1) holds is []{_, 4-#*0) = 47 Lind) > 4-4, 
Event (E2) holds if for every w € O(C), no set K € X with w € K is selected 
into Æ’. It follows directly from the definition of important cuts that for 
every K € X with w € K, the set A(K) is an important (w,v)-cut. Thus 
by Lemma X kex WER 4-l4)| < 1. To bound the probability of the 
event Z N 0 = ), we have to bound the probability that no important set 
intersecting O(C) is selected: 
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I] a-4)= J] (1 — 4-44) 


KEX wea(C) Ke% 
KNna(C)#0 wEK 
( —4-UK) 
2 ge —a(K ) 
wea(C) KEX ae) 
wek 
4 
> exp (-; ea) 
wed(C) KEX 
wEK 


> (e73) > e741/3, 

In the first inequality, we use the fact that every term is less than 1 and 
every term on the right-hand side appears at least once on the left-hand side. 
In the second inequality, we use the fact that 1+a > exp(x/(1+x)) for every 
x > —1. In the third inequality, we use the fact that d(K) > 1 follows from 
the assumption that the graph G is connected, hence 1—4~“(*) > 3/4 holds. 

Events (E1) and (E2) are independent: (E1) is a statement about the 
selection of a specific subcollection A C ¥ of at most q sets that are disjoint 
from (C), while (E2) is a statement about not selecting any member of a 
specific subcollection B C ¥ of at most |O(C)|- 4% sets intersecting S. Thus, 
with probability at least 2-°(@, both (E1) and (E2) hold. 

Suppose that both (E1) and (E2) hold. We show that the corresponding 
instance I of the SATELLITE PROBLEM is a yes-instance. In this case, every 
component K; of G\C is a component Vj of G[Z]: K; C Z by (E1) and every 
neighbor of K; is outside Z. Thus C is a solution of J, as it can be obtained 
as the union of Vo and some components of G[Z]. 














Lemma gives a randomized reduction from (p,q)-CLUSTER to the 
SATELLITE PROBLEM, which is polynomial-time solvable (Lemma [8.26}. 
Therefore, there is a randomized algorithm for (p, q)-CLUSTER with running 
time 20( . n°) and success probability pcorrecte = 27°. By repeating 
the algorithm [1/pcorrect| = 20 times, the probability of a false answer 
decreases to (1 — Deorrect)!1/Peoreet] < 1/e (using 1— x < e7”). 


Corollary 8.30. There is a2°\0-n° time randomized algorithm for (p, q)- 
CLUSTER with constant error probability. 


To derandomize the proof of Lemmal8.29]and to obtain a deterministic ver- 
sion of Corollary [8.30] we use splitters, which were introduced in Section [5.6] 
We present here only a simpler version of the derandomization, where the 
dependence on q in the running time is of the form 204°), Tt is possible to 
improve this dependence to 20, matching the bound in Corollary [8-30] 
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However, the details are somewhat technical and hence we omit here the 
description of this improvement. 

Recall that Theorem provided us with an (n,k,k?)-splitter of size 
O(k® log klogn): a family F of functions from [n] to [k?] such that for any 
subset X C [n] with |X| = k, one of the functions f in the family is injective 
on X. 


The main idea of the derandomization is to replace the random selection 
of the subfamily X’ by a deterministic selection based on the functions 
in a splitter. 


Theorem 8.31. (p,q)-CLUSTER can be solved in time 20°) . nO, 


Proof. In the algorithm of Lemma [8.29| a random subset of a universe X of 
size s = |X| < 49 -n is selected. If the (p, q)-CLUSTER problem has a solution 
C, then there is a collection A C ¥ of at most a := q sets and a collection 
BC X of at most b := q- 4% sets such that if every set in A is selected and no 
set in B is selected, then (E1) and (E2) hold. Instead of selecting a random 
subset, we try every function f in an (s,a + b, (a + b)?)-splitter F (obtained 
through Theorem 5.16}, and every subset F C [(a + b)?] of size a (there are 
(ee) = 20(9")) such sets F). For a particular choice of f and F, we select 
those sets S € X into X’ for which f(S) € F. The size of the splitter F 
ig 2°) . logn and the number of possibilities for F is 203°), Therefore, we 
construct 204°) : logn instances of the SATELLITE PROBLEM. 

By the definition of the splitter, there will be a function f that is injective 
on AUB, and there is a subset F such that f(S) € F for every set S in A and 
f(S) ¢ F for every set S in B. For such an f and F, the selection will ensure 
that (E1) and (E2) hold. This means that the constructed instance of the 
SATELLITE PROBLEM corresponding to f and F has a solution as well. Thus 
solving every constructed instance of the SATELLITE PROBLEM in polynomial 
time gives a 20(7) . n° algorithm for (p, q)-CLUSTER. 














One can obtain a more efficient derandomization by a slight change of the 
construction in the proof of Theorem and a more careful analysis. This 
gives a deterministic algorithm with 27‘ dependence on q. 


Theorem 8.32 ([324]). (p,q)-CLUSTER can be solved in time 2° . n°, 


Finally, we have shown in Lemma that there is a reduction from (p, q)- 
PARTITION to (p,q)-CLUSTER, hence fixed-parameter tractability follows for 
(p, q)-PARTITION as well. 


Theorem 8.33. (p,q)-PARTITION can be solved in time 20 . nO), 
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8.5 Directed graphs 


Problems on directed graphs are notoriously more difficult than problems 
on undirected graphs. This phenomenon has been observed equally often in 
the area of polynomial-time algorithms, approximability, and fixed-parameter 
tractability. Let us see if the techniques based on important cuts survive the 
generalization to directed graphs. 

Given a directed graph G and two disjoint sets of vertices X,Y C V(G), 
a directed (X,Y )-cut is a subset S of edges such that G \ S has no directed 
path from X to Y (but it may have directed paths from Y to X). We denote 
by AG(R) the set of edges starting in R and ending in V(G) \ R. As for 
undirected graphs, every minimal (X,Y )-cut S can be expressed as A(R) 
for some X C R C V(G) \ R. Important cuts can be defined analogously for 
directed graphs. 


Definition 8.34 (Important cut in directed graphs). Let G be a di- 
rected graph and let X,Y C V(G) be two disjoint sets of vertices. Let 
S C E(G) be an (X,Y)-cut, and let R be the set of vertices reachable from 
X in G \ S. We say that S is an important (X,Y)-cut if it is minimal and 
there is no (X,Y)-cut S’ with |S’| < |S| such that R C R’, where R’ is the 
set of vertices reachable from X in G \ S’. 


Proposition generalizes to directed graphs in a straightforward way. 


Proposition 8.35. Let G be a directed graph and X,Y C V(G) be two dis- 
joint sets of vertices. Let S be an (X,Y)-cut and let R be the set of ver- 
tices reachable from X in G\ S. Then there is an important (X,Y)-cut 
S’ = At(R’) (possibly, S' = S) such that |S’| < |S| and RC R. 


We state without proof that the bound of 4 of Theorem holds also 
for directed graphs. 


Theorem 8.36. Let X,Y C V(G) be two sets of vertices in a directed graph 
G with n vertices and m edges, let k > 0 be an integer, and let Sp be the set 
of all (X,Y )-important cuts of size at most k. Then |Sp| < 4* and Sp can be 
constructed in time O(|S,|-k?(n +m)). 


Also, an analogue of the bound of Lemma holds for directed important 
cuts. 


Lemma 8.37. Let G be a directed graph and let X,Y C V(G) be two dis- 
joint sets of vertices. If S is the set of all important (X,Y)-cuts, then 


D ! s 1. 


Given a directed graph G, a set T C V(G) of terminals, and an integer k, 
the DIRECTED EDGE MULTIWAY CUT problem asks for a set S of at most 
k edges such that G \ S has no directed tı — tz path for any two distinct 
t1,t2 € T. Theorem [8.36] gives us some hope that we would be able to use 
the techniques of Section [8.3] for undirected EDGE MULTIWAY CuT also for 
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Fig. 8.8: The unique important (t1, t2)-cut is the edge (b, t2), the unique im- 
portant (t2, tı)-cut is the edge (b, t1), but the unique minimum edge multiway 
cut is the edge (a, b) 


directed graphs. However, the pushing lemma (Lemma [8.18) is not true on 
directed graphs, even for |T| = 2; see Fig. [8.8] for a simple counterexample 
where the unique minimum multiway cut does not use any important cut be- 
tween the two terminals. The particular point where the proof of Lemma [8.18] 
breaks down in the directed setting is where a T \ t — t path appears af- 
ter replacing S with S*. This means that a straightforward generalization of 
Theorem [8.19]to directed graphs is not possible. Nevertheless, using different 
arguments (in particular, using the random sampling of important separators 
technique), it is possible to show that DIRECTED EDGE MULTIWAY CUT is 
FPT parameterized by the size k of the solution. 


Theorem 8.38 ([91, [89]). DIRECTED EpGrE MuLtiwAy CUT on an n- 
vertex graph can be solved in time 20°) . nO), 


The input of the DIRECTED EDGE MULTICUT problem is a directed graph 
G, a set of pairs (s1,t1), ..., (se, te), and an integer k, the task is to find a 
set S of at most k edges such that G \ S has no directed s; — t; path for any 
1 < i< k. This problem is more general than DIRECTED EDGE MULTIWAY 
CUT: the requirement that no terminal be reachable from any other terminal 
in G \ S can be expressed by a set of |T|((T| — 1) pairs (s;, ti). 

In contrast to the undirected version (Theorem [8-22}, DIRECTED EDGE 
MULTICUT is W[1|-hard parameterized by k, even on directed acyclic graphs. 
But the problem is interesting even for small values of €. The argument of 
Theorem (essentially, guessing how the components of G \ S partition 
the terminals) no longer works, as the relation of the terminals in G \ S can 
be more complicated in a directed graph than just a simple partition into 
connected components. The case @ = 2 can be reduced to DIRECTED EDGE 
MULTIWAY CUT in a simple way (see Exercise [8.14}, thus Theorem [5-38] 
implies that DIRECTED EDGE MULTICUT for @ = 2 is FPT parameterized by 
k. The case of a fixed £ > 3 and the case of jointly parameterizing by £ and k 
are open for general directed graphs. However, there is a positive answer for 
directed acyclic graphs. 


Theorem 8.39 ([309]). DIRECTED EDGE Muttiway CUT on directed acyclic 
graphs is FPT parameterized by £ and k. 
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There is a variant of directed multicut where the pushing argument does 
work. SKEW EDGE MULTICUT has the same input as DIRECTED EDGE MUL- 
TICUT, but now the task is to find a set S' of at most k edges such that G\ S 
has no directed s; — tj path for any i > j. While this problem is somewhat 
unnatural, it will be an important ingredient in the algorithm for DIRECTED 
FEEDBACK VERTEX SET in Section [8.6] 


Lemma 8.40 (Pushing lemma for Skew EDGE MutticutT). Let 
(G, ((51, t1),-.-, (se, t2)), k) be an instance of SKEW EDGE MULTICUT. If the 
instance has a solution S, then it has a solution S* with |S*| < |S| that 
contains an important (se, {t1,...,te})-cut. 


Proof. Let T = {t,,...,t¢} and let R be the set of vertices reachable from 
se in G \ S. As S is a solution, R is disjoint from T and hence At(R) is an 
(se, T)-cut. If Sr = A+ (R) is an important (se, T)-cut, then we are done: 3 
contains every edge of At(R). Otherwise, Proposition|8.35]implies that there 
is an important (X,Y)-cut S’ = At(R’) such that R C R’ and |S’| < |Spl. 
We claim that S* = (S'\SR)US", which has size at most |S], is also a solution, 
proving the statement of the lemma. 


The reason why replacing Spr with the important (se, T)-cut S’ does not 
break the solution is because every path that we need to cut in SKEW 
EDGE Mutricut ends in T = {t),..., te}. 


Clearly, there is no path between sg and any terminal in T in G \ S*, as 
S” C S* is an (sg, T)-cut. Suppose therefore that there is an s; + t; path P 
for some i > j in G \ S*. If P goes through a vertex of R C R’, then it goes 
through at least one edge of the (s¢, T)-cut S’ = At(R’), which is a subset 
of S*, a contradiction. Therefore, we may assume that P is disjoint from R. 
Then P does not go through any edge of Sr = A(R) and therefore the fact 
that it is disjoint from S* implies that it is disjoint from S, contradicting the 
assumption that S' is a solution. 


Equipped with Lemma and Lemma |8.40| a branching algorithm very 
similar to the proof of Theorem shows the fixed-parameter tractability 
of SKEW EDGE MULTICUT. 














Theorem 8.41. SKEW EDGE MULTICUT on a graph with n vertices and m 
edges can be solved in time O(4* - kè - (n + m)). 
8.6 DIRECTED FEEDBACK VERTEX SET 


For undirected graphs, the FEEDBACK VERTEX SET problem asks for a set 
of at most k vertices whose deletion makes the graph acyclic, that is, a forest. 
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We have already seen a branching algorithm for this problem in Section 
an algorithm based on iterative compression in Section and a simple 
randomized algorithm in Section [5.1] It is easy to see that the edge version 
of FEEDBACK VERTEX SET is polynomial-time solvable. 

For directed graphs, both the edge and the vertex versions are interesting. 
A feedback vertex set of a directed graph G is set S C V(G) of vertices such 
that G \ S has no directed cycle, while a feedback arc M is a set S C E(G) 
of edges such that G \ S has no directed cycle. Given a graph G and an 
integer k, DIRECTED FEEDBACK VERTEX SET asks for a feedback vertex set 
of size at most k. The DIRECTED FEEDBACK ARC SET is defined analogously; 
recall that in Section [2.2.2] we have designed a polynomial kernel for a special 
case of DIRECTED FEEDBACK ARC SET, namely FEEDBACK ARC SET IN 
TOURNAMENTS. 

There is a simple reduction from the vertex version to the edge version 
(Exercise asks for a reduction in the reverse direction). 


Proposition 8.42. DIRECTED FEEDBACK VERTEX SET can be reduced to 
DIRECTED FEEDBACK ARC SET in linear time without increasing the pa- 
rameter. 


Proof. Let (G,k) be an instance of DIRECTED FEEDBACK VERTEX SET. 
We construct a graph G’, where two vertices vin and Vout correspond to 
each vertex v € V(G). For every edge (a,b) € E(G), we introduce the edge 
(dout, bin) into G”. Additionally, for every v € V(G), we introduce the edge 
(Vin, Vout) into G”. Observe that there is a one-to-one correspondence between 
the directed cycles of length exactly £ in G and the directed cycles of length 
exactly 2¢ in G’. 

We claim that G has a feedback vertex set S of size at most k if and only 
if G’ has a feedback arc set S’ of size at most k. For the forward direction, 
given a feedback vertex set S C V(G) of G, let S’ contain the corresponding 
edges of G’, that is, S = {(vin, Vous) : v € S}. If there is a directed cycle 
C’ in G’ \ S’, then the corresponding cycle C in G contains at least one 
vertex v € S. But then the corresponding edge (vin, Vout) of C” is in S’, a 
contradiction. 

For the reverse direction, let S” C E(G) be a feedback arc set of G’. We may 
assume that every edge of S” is of the form (vin, Vout): if (Gout; bin) € S”, then 
we may replace it with (bin, bout), as every directed cycle going through the 
former edge goes through the latter edge as well. Let S = {vu : (vin, Vout) € 
S'}. Now, if G\ S has a directed cycle C, then the corresponding directed 
cycle C” of G goes through an edge (vin, Vout) € S”, implying that vertex v of 
C is in S, a contradiction. 

















2 Some authors prefer to use the term “arc” for the edges of directed graphs. While we 
are using the term “directed edge” in this book, the term “arc” is very commonly used in 
the context of the DIRECTED FEEDBACK ARC SET problem, hence we also use it in this 
section. 
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In the rest of the section, we show the fixed-parameter tractability of 
DIRECTED FEEDBACK ARC SET using the technique of iterative compression, 
introduced in Chapter [4] 


We apply a trick that is often useful when solving edge-deletion prob- 
lems using iterative compression: the initial solution that we need to 
compress is a vertex set, not an edge set. 


That is, the input of the DIRECTED FEEDBACK ARC SET COMPRESSION 
problem is a directed graph G, an integer k, and a feedback vertex set W of 
G; the task is to find a feedback arc set S of size at most k. We show, by a re- 
duction to SKEW EDGE MULTICUT, that this problem is FPT parameterized 
by |W| and k. 


Lemma 8.43. DIRECTED FEEDBACK ARC SET COMPRESSION can be solved 
in time O(|W|!- 4* - k(n + m)) on a graph with n vertices and m edges. 


Proof. The task is to find a set S of edges where G\ S has no directed cycle, or, 
equivalently, has a topological ordering. Every topological ordering induces 
an ordering on the set W of vertices. Our algorithm starts by guessing an 
ordering W = {w1,...,Wjw)} and the rest of the algorithm works under the 
assumption that there is a solution compatible with this ordering. There are 
|W! possibilities for the ordering of W, thus the running time of the rest of 
the algorithm has to be multiplied by this factor. 


Every directed cycle contains a w; + wj path for some i > j: the cycle 
contains at least one vertex of W and has to go “backwards” at some 
point. Therefore, cutting all such paths (which is a problem similar to 
SKEW EDGE MULTICUT) gives a feedback arc set. 


We build a graph G” the following way. We replace every vertex w; with two 
vertices s;, t;, and an edge (t;, si) between them. We define Ew = {(ti, si) : 
1 <i < |W|}. Then we replace every edge (a,w,) with (a,t;), every edge 
(w;,a) with (s;,a), and every edge (w;, wj) with (s;, tj) (see Fig. [8.9) Let us 
define the SkEw EDGE MULTICUT instance (G’, (s1,t1),.--, (sjwy,tjwy), &)- 
The following two claims establish a connection between the solutions of this 
instance and our problem. 


Claim 8.44. If there is a set S C E(G) of size at most k such that G\ S 
has a topological ordering inducing the order wi, ..., ww on W, then the 
SKEW EDGE MULTICUT instance has a solution. 


Proof. For every edge e € S, there is a corresponding edge e’ of G’; let S’ 
be the set corresponding to S. Suppose that G’ \ S has a directed s; > tj 
path P for some i > j. Path P may go through several edges (tr, sr), but it 
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tı S1 t2 S2 tiw siw] 


Fig. 8.9: Solving DIRECTED FEEDBACK ARC SET COMPRESSION in the proof 
of Lemma [8.43} Every directed cycle in G” has an s; — tj subpath for some 
i> j 


has an sy — t; subpath P’ such that i’ > j’ and P’ has no internal vertex 
of the form tr, sr. If i’ = 7’, then the edges of G corresponding to P’ give 
directed cycle C in G disjoint from S, a contradiction. If i’ > 7’, then there 
is a directed wy — w, path P of G corresponding to P’ that is disjoint from 
S, contradicting the assumption that w; is later than w, in the topological 
ordering of G \ S. 4 


Claim 8.45. Given a solution S of the SKEW EDGE MULTICUT instance, 
one can find a feedback arc set S of size at most k for G. 


Proof. Suppose that the SKEW EDGE MULTICUT instance has a solution S$’ 
of size at most k. For every edge of E(G’) \ Ew, there is a corresponding 
edge of G; let S be the set of at most k edges corresponding to S$” \ Ew. 
Suppose that G \ S has a directed cycle C. This cycle C has to go through 
at least one vertex of W. If C goes through exactly one vertex w;, then the 
edges of G” corresponding to C give a directed t; + s; path disjoint from S$’, 
a contradiction (note that this path is certainly disjoint from Ew). If C goes 
through more than one w;, then C has a directed w; + w; subpath for some 
i > j such that the internal vertices of P are not in W. Then the edges of G” 
corresponding to P give a directed s; — t; path in G’ disjoint from S’ (and 
from Ew), again a contradiction. 4 


Suppose that there is a feedback arc set S C E(G) of size at most k. A 
topological ordering of G \ S induces an ordering on W and, as the algorithm 
tries every ordering of W, eventually this ordering is reached. At this point, 
Claim implies that the constructed SKEW EDGE MULTICUT instance 
has a solution S’ and we can find such a solution using the algorithm of 
Theorem Then Claim [8.45] allows us to find a solution of DIRECTED 
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FEEDBACK ARC SET COMPRESSION. The algorithm tries |W]! orderings of 
W and uses the O(4" - k? - (n + m))-time algorithm of Theorem |8.41} every 
other step can be done in linear time. 














Finally, let us see how iterative compression can be used to solve DIRECTED 
FEEDBACK ARC SET using DIRECTED FEEDBACK ARC SET COMPRESSION. 


Theorem 8.46. There is an O((k + 1)!-4* - k? + n(n + m))-time algorithm 
for DIRECTED FEEDBACK ARC SET on graphs with n vertices and m edges. 


Proof. Let vı, ..., Un be an arbitrary ordering of the vertices of G and let 
Gi = Gi{v1,..., vi }]. For i = 1, 2, ..., n, we compute a feedback arc set S; 
of size at most k for G;. For i = 1, S; = Í is a trivial solution. Suppose now 
that we have already computed S;; then S;,; can be computed as follows. 
Let W; C V(G;) be the head of each edge in S;. Clearly, |W;| < k and G\ W; 
is acyclic (since G \ S; is acyclic). Moreover, W; U {viii} has size at most 
k+1 and Gi+ı \ (WiU {v;41}) = G; \ Wi, hence it is also acyclic. Therefore, 
we may invoke the algorithm of Lemma [8.43] to find a feedback arc set Sj41 
of size at most k for G41. If the algorithm answers that there is no feedback 
arc set of size at most k for Gi+1, then there is no such feedback arc set 
for the supergraph G of Gi+ı and hence we can return a negative answer 
for DIRECTED FEEDBACK ARC SET on (G,k). Otherwise, if the algorithm 
returns a feedback arc set Si+ı for Gj, then we can continue the iteration 
with i+1. 

As |W;| < k + 1 in each call of the DIRECTED FEEDBACK ARC SET 
COMPRESSION algorithm of Lemma [8.43] its running time is O((k + 1)!-4*- 
k3-(n-+m)). Taking into account that we call this algorithm at most n times, 
the claimed running time follows. 














Using the simple reduction from the vertex version to the edge version 
(Proposition |8.42), we obtain the fixed-parameter tractability of DIRECTED 
FEEDBACK VERTEX SET. 


Corollary 8.47. There is an O((k+1)!-4*-k?-n(n+m))-time algorithm for 
DIRECTED FEEDBACK VERTEX SET on graphs with n vertices and m edges. 


8.7 Vertex-deletion problems 


All the problems treated so far in the chapter were defined in terms of remov- 
ing edges of a graph (with the exception of DIRECTED FEEDBACK VERTEX 
SET, which was quickly reduced to DIRECTED FEEDBACK ARC SET). In this 
section, we introduce the definitions needed for vertex-removal problems and 
state (without proofs) the analogues of the edge-results we have seen in the 
previous sections. 
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A technical issue specific to vertex-removal problems is that we have to 
define whether a vertex set separating X and Y is allowed to contain a vertex 
of XUY. If so, then X and Y need not be disjoint (but then the separator has 
to contain X NY). In a similar way, we have to define whether, say, VERTEX 
MuttTIway CurT allows the deletion of the terminals. Note that in an edge- 
deletion problem we can typically make an edge “undeletable” by replacing it 
with k+1 parallel edges. Similarly, one may try to make a vertex undeletable 
by replacing it with a clique of size k + 1 (with the same neighborhood), but 
this can be notationally inconvenient, especially when terminals are present. 
in the graph. Therefore, we state the definitions and the results in a way that 
we assume that the (directed or undirected) graph G is equipped with a set 
V~(G) C V(G) of undeletable vertices. Given two (not necessarily disjoint) 
sets X,Y C V(G) of vertices, an (X,Y )-separator is a set S C V(G)\V™(G) 
of vertices such that G \ S has no (X \ S) — (Y \ S) path. We define minimal 
and minimum (X, Y)-separators the usual way. We can characterize minimal 
(X, Y)-separators as the neighborhood of a set of vertices. 


Proposition 8.48. If S is a minimal (X,Y)-separator in G, then S = 
N@(R) (or S = N(R) in directed graphs), where R is the set of vertices 
reachable from X \ S in G \ 5S. 


The results of Section [8.1|can be adapted to (X, Y)-separators; we omit the 
details. Important (X,Y )-separators are defined analogously to important 
(X, Y)-cuts. 


Definition 8.49 (Important separator). Let G be a directed or undi- 
rected graph and let X,Y C V(G) be two sets of vertices. Let S C 
V(G)\V™(G) be an (X, Y)-separator and let R be the set of vertices reach- 
able from X\S in G\S. We say that S is an important (X,Y )-separator if it is 
inclusion-wise minimal and there is no (X, Y)-separator S’ C V(G) \ V™(G) 
with |S’| < |S] such that R C R’, where R’ is the set of vertices reachable 
from X \ S’ in G\ S. 


The analogue of Proposition holds for (X, Y )-separators. 


Proposition 8.50. Let G be a directed or undirected graph and X,Y C V(G) 
be two sets of vertices. Let S C V(G) \ V%°(G) be an (X,Y)-separator and 
let R be the set of vertices reachable from X \ S in G\ S. Then there is an 
important (X,Y)-separator S’ = Ng(R’) (or S! = N(R’) if G is directed) 
such that |S"| < |S| and RC R’. 


The algorithm for enumerating important (X, Y)-cuts and the combinatorial 
bounds on their number can be adapted to important (X,Y )-separators. 


Theorem 8.51. Let X,Y C V(G) be two sets of vertices in a (directed or 
undirected) graph G with n vertices and m edges, let k > 0 be an integer, and 
let Sp be the set of all (X,Y)-important separators of size at most k. Then 
|Sz| < 4" and Sp can be constructed in time O(|Sp| -k? «(n+ m)). 
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Lemma 8.52. Let G be a (directed or undirected) graph and let X,Y C V(G) 
be two sets of vertices. If S is the set of all important (X,Y )-separators, then 


ae ea oe 


Let G be a graph and T C V(G) be a set of terminals. A verter multiway 
cut is a set S C V(G) \ V®(G) of vertices such that every component of 
G \ S contains at most one vertex of T \ S. Given an undirected graph G, 
an integer k, and a set T C V(G) of terminals, the VERTEX MULTIWAY CUT 
problem asks for vertex multiway cut of size at most k. The analogue of the 
pushing lemma for EDGE MULTIWAY CUT (Lemma 8.18) can be adapted to 
the vertex-deletion case. 


Lemma 8.53 (Pushing lemma for VERTEX MULTIWAY CUT). Lett €T 


be a terminal in an undirected graph G. If G has a vertex multiway cut S C 
V(G) \V™(G), then it also has a verter multiway cut S* C V(G) \ V°(G) 
with |S*| < |S| such that S* contains an important (t,T \ t)-separator. 


Theorem Lemma|8.52| and Lemma allow us to solve VERTEX MUL- 
TIWAY CUT using a branching strategy identical to the one used in Theo- 
rem |8.19 


Theorem 8.54. VERTEX MULTIWAY CUT on a graph with n vertices and m 
edges can be solved in time O(4* - kè - (n + m)). 


The result of Theorem was actually proved in the literature for the (more 
general) vertex version, hence VERTEX MULTICUT is also fixed-parameter 
tractable parameterized by k. 


Theorem 8.55 ([59), ). VERTEX MULTICUT on an n-vertex graph can 


be solved in time 22%”) . nOQ), 


Exercises 
8.1. Given an undirected graph G and S C V(G), let ig(S) be number of edges induced 
by S (that is, i¢(S) = |E(G[S])|). Is ig submodular? 
8.2. Prove that a set function f : 21!V(@! — R is submodular if and only if 
F(AU {v}) — F(A) > F(B uU {v}) — f(B) (8.3) 


holds for every A C B C V(G) and v € V(G). Informally, inequality (8.3) says that the 
marginal value of v with respect to the superset B (that is, the increase of value if we 
extend B with v) cannot be larger than with respect to a subset A. 


8.3. Let f : 2I1V(©®)| — R be a submodular function that is symmetric: f(X) = f(V(G)\X) 
for every X C V(G). Show that f is posimodular: 


F(A) + f(B) > F(A \ B) + F(B\ A) (8.4) 


holds for every A, B C V(G). 
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8.4. Give an example of a graph G and sets A,B C V(G) for which the submodularity 
inequality is sharp. 


8.5. Give a polynomial-time algorithm for finding the sets Rmin and Rmax defined in 
Theorem|8.4Jusing only the facts that such sets exists and that a minimum (X, Y )-cut can 
be found in polynomial time. 


8.6. Let G be an undirected graph and let A,B C V(G) be two disjoint sets of vertices. 
Let A(RAP) and A(RAB.) be the minimum (A, B)-cuts defined by Theorem and let 


A(RB4) and A(RB4A.) be the minimum (B, A)-cuts defined by Theorem reversing 


min max 


the role of A and B). Show that RAB = V(G) \ RBA. and RAB. = V(G)\ RBA 


min max max min’ 






8.7. Prove Proposition [8.9] 


8.8. Give an example where the algorithm of Theorem|8.11}finds an (X, Y)-cut that is not 
an important (X,Y)-cut (and hence throws it away in the filtering phase). Can such a cut 
come from the first recursive branch of the algorithm? Can it come from the second? 


8.9. Let G be an undirected graph, let X,Y C V(G) be two disjoint sets of vertices, and 
let A be the minimum (X, Y)-cut size. Let S be the set of all important (X, Y)-cuts. Prove 
that X ges 47/5! < 27> holds. 


8.10. Prove Theorem [8.16 

8.11. Show that Epee Muttiway Cur is polynomial-time solvable on trees. 
8.12 (2). Show that Eper Mutricut is NP-hard on trees. 

8.13. Give a 2% -nO)-time algorithm for Enar Muuricut on trees. 


8.14. Reduce DIRECTED EDGE MuLTICUT with £ = 2 to DIRECTED EpGE Mu.Ltiway Cut 
with |T| = 2. 


8.15. Reduce Epee Muttiway Cut with |T| = 3 to DireEcTED Epee Muttiway Cut 
with |T| = 2. 


8.16. Reduce DIRECTED FEEDBACK ARC SET to DIRECTED FEEDBACK VERTEX SET in 
polynomial time. 


8.17. Show that VERTEX MULTICUT is polynomial-time solvable on trees. 


8.18 (%). In the Dicrapu Pair Curt, the input consists of a directed graph G, a desig- 
nated vertex s € V (G), a family of pairs of vertices F C (re); and an integer k; the goal 
is to find a set X of at most k edges of G, such that for each pair {u,v} € F, either u or 
v is not reachable from s in the graph G — X. Show an algorithm solving DIGRAPH PAIR 
Cur in time 2*n©() for an n-vertex graph G. 


8.19. Consider the following vertex-deletion variant of DigRapH Parr Cur with a sink: 
given a directed graph G, designated vertices s,t € V(G), a family of pairs of vertices 
F C (VY), and an integer k, check if there exists an (s, t)-separator X C V(G) \ {s,t} of 
size at most k such that every pair in F either contains a vertex of X or contains a vertex 
that is unreachable from s in G — X. 

Show how to solve this vertex-deletion variant in time 2*n°@ for an n-vertex graph 
G, using the algorithm of the previous exercise. 
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Hints 


The function ig is not submodular, but supermodular: the left-hand side of (8.1) is 
always at most the right-hand side. 


To see that pli a necessary condition for f being submodular, observe that it is the 
rearrangement of (8.1) applied for AU {v} and B. To see that it is a necessary condition, 
let us build X UY from X NY by first adding the elements of X \ Y one by one and then 
adding the elements of Y \ X one by one. Let us compare the total marginal increase of 
adding these elements to the total marginal increase of building X from X NY, plus the 
total marginal increase of building Y from X NY. The submodularity follows by observing 
that the marginal increase of adding elements of Y \ X cannot be larger if X \ Y is already 
present in the set. 


By submodularity and symmetry, we have 


f(A) + F(B) = F(A) + F(V(G) \ B) 
2 F(AN (V (G) \ B)) + F(A U (V(G) \ B)) 
( 
( 


A\ B) + f(V(G) \ (B\ A)) 
A\ B) + f(B\ A). 


Analyze the proof of (8-1): which edges are counted different number of times in the 
sides of this inequality? Any example with an edge between A \ B and B \ A will do the 
job. 


If A(R) is a minimum (A, B)-cut, then A(R) = A(V (G) \ R) and it is also a minimum 
(B, A)-cut. The equalities then follow simply from the fact that minimizing R is the same 
as maximizing V(G) \ R. 


f 
f )4 
f ) +4 





It is possible that the first branch gives an important (X, Y )-cut S in G\ zy such that 
SU {xy} is not an important (X,Y)-cut in G. For example, let X = {x}, Y = {y}, and 
suppose that graph G has the edges xa, xb, ab, bc, ay, by, cy. Then Rmax = {x}. In graph 
G\zb, the set Ag zo({x, a}) = {ab, ay} is an important (X, Y )-cut. However, Ag({z,a}) = 
{ay, ab, xb} is not an important (X,Y )-cut in G, as Ag({z, a, b,c}) = {ay, by, cy} has the 
same size. 

One can show that every (X,Y)-cut returned by the second branch is an important 
(X, Y)-cut in G. 


We need to show that in each of the two branches of the algorithm in Theorem [8.11] 
the total contributions of the enumerated separators to the sum is at most 2~*/2. In the 
first branch, this is true because we augment each cut with a new edge. In the second 
branch, this is true because strictly increases. 


The proof has the following main arguments: 


1. The enumeration algorithm should be of the form Enumerate(G, X,Y, Z, k), which, 
assuming Z C Ag(X), returns every important (X, Y)-cut of size at most k containing 


Z. 
2. As in the proof of Theorem we compute Rmax in G \ Z and branch on an edge 
zy € Ag(X). 


3. In the first branch, we include zy into Z. We recurse only if no edge of Z has both end- 
points in Rmax and the algorithm of Lemmalg.15]predicts that Enumerate(G, Rmax, Y, 
Z U {xy}, k) returns at least one important (X, Y)-cut. 

4. In the second branch, we check if no edge of Z has both endpoints in Rmax U {y} and 
recurse if Lemma|8.15]predicts that Enumerate(G, Rmax U {y}, Y, Z, k) returns at least 
one important (X, Y )-cut. 
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5. Observe that now every recursive call returns at least one important (X, Y )-cut, hence 
the number of leaves of the recursion tree can be bounded from above by the number 
of important (X, Y)-cuts returned. 


Solution 1: Dynamic programming. Assume that the tree is rooted and let T, be 
the subtree rooted at v. Let A, be the minimum cost of separating the terminals in Ty, 
and let By, be the minimum cost with the additional constraint that v is separated from 
every terminal in T,,. Give a recurrence relation for computing A, and By, if these values 
are known for every child of v. 

Solution 2: We may assume that every leaf of the tree contains a terminal, otherwise the 
leaf can be removed without changing the problem. Select any leaf and argue that there is 
a solution containing the edge incident to this leaf. 


Reduction from VERTEX Cover: the graph is a star, edges play the role of vertices, 
and the terminal pairs play the role of edges. 


Assume that the tree is rooted and select a pair (s;, ti) such that the topmost vertex 
v of the unique s; —t; path P has maximum distance from the root. Argue that there is 
a solution containing an edge of P incident to v. 


Introduce two terminals v1, v2, and add the edges (v1, 1), (t1, v2), (v2, 82), (t2, v1). 


Subdivide each edge and replace each original vertex with an independent set of size 
k+1. 


Assume that the tree is rooted and select a pair (s;,¢;) such that the topmost vertex 
v of the unique s; — t; path P has maximum distance from the root. Argue that there is 
a solution containing vertex v. 


8.18] The main observation is the following: if we choose a set of vertices F C |J F such 
that no vertex of v is reachable from s in the graph G—X (for a solution X we are looking 
for), then we may greedily choose an (s, F)-cut of minimum possible size that minimizes 
the set of vertices reachable from s. 

In other words, we consider the following algorithm. Start with F = Ø, and repeatedly 
perform the following procedure. Compute a minimum (s, F)-cut X = A*(R) that min- 
imizes the set R. If X is a solution, then report X. If |X| > k, then report that there is 
no solution in this branch. Otherwise, pick a pair {u,v} € F such that both u and v are 
reachable from s in G — X, and branch: include either u or v into F. Prove that the size 
of X strictly increases at each step, yielding the promised bound on the size of the search 
tree. 


8.19] Perform the standard reduction from directed edge-deletion problems to vertex- 
deletion ones, by going to the (appropriately adjusted) line graph of the input graph. To 
handle the sink t, replace t with its k + 2 copies t1,t2,...,t,42, and add a pair {t;, tj} to 
F for every 1<i<j<k+2. 


Bibliographic notes 


There is a large body of literature on polynomial-time algorithms for minimum cut prob- 
lems, maximum flow problems, and their relation to submodularity. The monumental work 
of Schrijver and the recent monograph of Frank give detailed overviews of the 
subject. The Ford-Fulkerson algorithm for finding maximum flows (which is the only max- 
imum flow algorithm we needed in this chapter) was published in 1956 [2I2]. For other, 
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more efficient, algorithms for finding maximum flows, see any standard algorithm textbook 
such as Cormen, Leiserson, Rivest, Stein [97]. 

Marx defined the notion of important separators and proved the fixed-parameter 
tractability of VERTEX Muttiway Cur using the branching approach of Theorem [8.19] 
The 4* bound and the proof of Theorem is implicit in the work of Chen, Liu, and 
Lu [84]. Using different approaches, one can obtain better than 4E . nC) time algorithms: 
Xiao gave a 2” . n?0)-time algorithm for Epee MurTIway Cut; Cao, Chen, and 
Fan gave a 1.84". n°) time algorithm for Enae Muttiway Cut; and Cygan, Pilipczuk, 
Pilipczuk, and Wojtaszczyk gave a 2*.n©() time algorithm for VERTEX MULTIWAY 
Cut. 

The NP-hardness of Epee MuLticuT on trees was observed by Garg, Vazirani, and 
Yannakakis [225]. The 2*-n°) algorithm of Exercise[8.13}is by Guo and Niedermeier [242]. 
The polynomial-time algorithm for solving VERTEX MutticutT on trees (Exercise|8.17) was 
described first by Calinescu, Fernandes, and Reed [69]. 

The NP-hardness of Epee Muttiway Cut with |T| = 3 was proved by Dalhaus, 
Johnson, Papadimitriou, Seymour, and Yannakakis [124]. Interestingly, for a fixed number 
£ of terminals, they also show that the problem can be solved in time nO time on planar 
graphs. The running time for planar graphs was improved to 20™/ nO(ve) by Klein and 
Marx [295]. 

The fixed-parameter tractability of EDGE Mutticut and VERTEX MULTICUT param- 
eterized by the solution size k was proved independently by Bousquet, Daligault, and 
Thomassé and Marx and Razgon [357|. The paper of Marx and Razgon introduced 
the random sampling of important separators technique. This technique was used then by 
Lokshtanov and Marx to prove the fixed-parameter tractability of (p,q)-CLUSTER 
parameterized by q. Lokshtanov and Marx proved the problem is FPT also with parame- 
ter p, and studied other variants of the problem, such as when the requirement |C| < p is 
replaced by requiring that the graph induced by C have at most p nonedges. This variant 
was recently used to analyze modular structure of large networks in nature [60]. 

Uncrossing (variants of) submodular functions in different ways is a standard technique 
of combinatorial optimization. The variant described in Section [8.4] using posimodularity 
to uncross sets by replacing them with subsets, was used by, for example, Chekuri and Ene 
for submodular partition and multiway cut problems [75} [74]. 

The random sampling of important separators technique was used by Lokshtanov and 
Ramanujan to solve Parity Muttiway Cut and by Chitnis, Egri, and Marx 
to solve a certain list homomorphism problem with vertex removals. The technique was 
generalized to directed graphs by Chitnis, Hajiaghayi, and Marx to show the fixed- 
parameter tractability of DIRECTED Epee Muttiway Cut and DIRECTED VERTEX MUL- 
TIway CUT parameterized by the size of the solution. The directed version was used by 
Kratsch, Pilipczuk, Pilipczuk, and Wahlström to prove that Epee Mutticut is FPT 
parameterized by k and £ on directed acyclic graphs and by Chitnis, Cygan, Hajiaghayi, 
and Marx to prove that DIRECTED SUBSET FEEDBACK VERTEX SET is FPT. 


The fixed-parameter tractability of Skew EDGE Mutticut, DIRECTED FEEDBACK 
VERTEX SET, and DIRECTED FEEDBACK ARC SET were shown by Chen, Liu, Lu, 
O’Sullivan, and Razgon [85). 


Chapter 9 
Advanced kernelization algorithms 


The systematic study of the kernelization framework, 
whose foretaste we had in Chapter [3 revealed an in- 
trinsic mathematical richness of this notion. In par- 
ticular, many classic techniques turn out to be very 
useful in this context; examples include tools of combi- 
natorial optimization, linear algebra, probabilistic ar- 
guments, or results of the graph minors theory. In this 
chapter, we provide an overview of some of the most 
interesting examples of more advanced kernelization 
algorithms. In particular, we provide a quadratic ker- 
nel for the FEEDBACK VERTEX SET problem. We also 
discuss the topics of above guarantee parameteriza- 
tions in the context of kernelization, of kernelization 
on planar graphs, and of so-called Turing kerneliza- 
tion. 





Recall that a kernelization algorithm, given an instance (x,k) of some 
parameterized problem, runs in polynomial time and outputs an equivalent 
instance (a’,k’) of the same problem such that |a’|,k’ < f(k), for some 
function f. This function f is called the size of the kernel. The smaller the 
kernel is, the better; that is, we would like to have f(k) as slowly-growing as 
possible. Usually, the goal is to obtain f that is polynomial or even linear. 

As discussed in Chapter [2] from the practical point of view, parameterized 
complexity gives us a framework to rigorously analyze various preprocessing 
algorithms, present in the literature for decades. From the theoretical point 
of view, studying kernelization has two main objectives. First, the existence 
of a kernel for a decidable parameterized problem is equivalent to its fixed- 
parameter tractability, and kernelization can be seen as one of the ways to 
prove that a studied problem is FPT. For instance, for the EDGE CLIQUE 
COVER problem of Chapter[2|(Section 2.2) the simple kernelization algorithm 
is in fact the only known approach from the parameterized perspective. Sec- 
ond, the pursuit of as small a kernel size as possible — in particular, the 
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study of which problems admit polynomial kernels, and which (most likely) 
do not — leads to a better understanding and deeper insight into the class 
of FPT problems. Whereas Chapter [2]and this chapter are devoted to prov- 
ing positive results, we give the methodology for proving lower bounds for 
kernelization in Chapter [15] 

We start this chapter with a beautiful example of how classic tools of 
advanced matching theory can be used to obtain a kernel for the FEEDBACK 
VERTEX SET problem. We perform a deep analysis of the role of high-degree 
vertices in a FEEDBACK VERTEX SET instance, obtaining a reduction that 
bounds the maximum degree of the input graph linearly in the parameter. 
This implies an O(k?) bound on the number of vertices and edges in a yes- 
instance, yielding the kernel bound. As we will see in Chapter under 
plausible complexity assumptions, this kernel is essentially tight with respect 
to the bitsize, i.e., the number of bits in the binary representation. 

In Section we discuss the topic of above guarantee parameteriza- 
tions in the context of kernelization, on the example of MAx-Er-SAT. In 
Chapter [3] we have seen an example of above guarantee parameterizations of 
VERTEX COVER, namely VERTEX COVER ABOVE LP and VERTEX COVER 
ABOVE MATCHING. In the MAx-Er-SAT problem, we are given an r-CNF 
formula y, where every clause contains exactly r literals with pairwise dif- 
ferent variables, together with an integer k, and we ask for an assignment 
satisfying at least k clauses. It is easy to see that a random assignment sat- 
isfies m(1 — 27”) clauses on average, where m is the number of clauses in 
y. Hence, we expect k to be rather large in interesting instances, and pa- 
rameterization by k does not make much sense. Instead, we study the above 
guarantee parameterization of k — m(1 — 27"). 

For the MAx-Er-SAT problem, we show how a combination of an alge- 
braic approach and probabilistic anti-concentration inequalities leads to a 
polynomial kernel. The main idea can be summarized as follows: we express 
the input formula as a multivariate polynomial, and argue that if the poly- 
nomial has too many monomials, then some moment inequalities imply that 
a random assignment satisfies the required number of clauses with positive 
probability. Consequently, in this case we can infer that we are dealing with 
a yes-instance. On the other hand, if the polynomial has a bounded number 
of monomials, the polynomial itself is already a “concise” description of the 
instance at hand. 

In Section [9.3] we briefly touch the tip of the iceberg: the field of kerneliza- 
tion in sparse graph classes. In Section[7-7] we have seen that some topological 
sparsity assumption on the input graph, such as planarity, greatly helps in 
designing FPT algorithms: for example, DOMINATING SET, which is unlikely 
to be FPT in general graphs (see Chapter [13), admits even a subexponential 
algorithm on planar graphs. In this chapter, we show how planarity con- 
straints can be exploited to give a linear kernel for the CONNECTED VERTEX 
COVER problem in planar graphs. This should be contrasted with the fact 
that the same problem most probably does not admit a polynomial kernel 
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in general graphs, as we will learn in Chapter (see Exercise point 
3). A similar behavior can be observed for many other problems. Note also 
that in Section [7.7] we have shown a subexponential FPT algorithm for CON- 
NECTED VERTEX COVER in planar graphs, whereas for general graphs only 
single-exponential algorithms are known (see Exercise [6.6). 

Finally, in Section [9.4] we discuss a relaxed variant of kernelization, called 
Turing kernelization. Recall that a preprocessing algorithm aims at reducing 
the time-consuming computation to as small an instance as possible. In the 
classic kernelization framework we produce a single small equivalent instance; 
here, we relax the condition of having a single instance by allowing the al- 
gorithm to solve for free (i.e., in an oracle fashion) instances of small size. 
In particular, we can return a set of small instances together with a recipe 
on how to obtain a solution for the original instance based on solutions for 
the generated instances; note that such an algorithm allows us, for example, 
to solve the output instances in parallel on multiple machines. However, in 
principle we do not need to limit ourselves to such an approach: adaptive 
algorithms, whose next steps depend on an oracle answer to some previous 
query, are also allowed. The working example for this approach will be the 
MAX LEAF SUBTREE problem, where the task is to determine whether there 
exists a subtree of a given graph that has at least k leaves. As we shall see, 
this problem separates classic kernelization from Turing kernelization: while 
it admits a polynomial Turing kernel, no classic polynomial kernel can be 
expected under plausible complexity assumptions (see Exercise [15.4] point 


2). 


9.1 A quadratic kernel for FEEDBACK VERTEX SET 


Recall that a feedback vertex set of a graph G is a set of vertices X C V(G) 
that hits all the cycles of G; equivalently, G \ X is a forest. In the FEEDBACK 
VERTEX SET problem we look for a feedback vertex set of a given graph G 
whose size does not exceed the given budget k, where k is the parameter. 

As in Section [3.3] it is convenient to consider FEEDBACK VERTEX SET on 
multigraphs, where the input graph G may contain multiple edges and loops. 
We treat double edges and loops as cycles. We also use the convention that 
a loop at a vertex v contributes 2 to the degree of v. 

Let us remind the reader that in Section we introduced five reduction 
rules for FEEDBACK VERTEX SET theca VS TIVES) such that, after 
exhaustively applying these rules, either we immediately conclude that we are 
dealing with a no-instance, or the reduced graph G contains no loops, only 
single and double edges, and has minimum degree at least 3. The behavior 
of these rules was as follows: 


e Reduction |FVS.1| remove a vertex with a loop and decrease the budget; 
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e Reduction [FVS.2} reduce multiplicity of an edge with multiplicity more 
than 2 to exactly 2; 

e Reduction [FVS.3} remove a vertex of degree 0 or 1; 

e Reduction contract a vertex of degree 2 onto one of its neighbors; 

° iara AEE if k < 0, then conclude that we are dealing with a 
no-instance. 


All these rules are trivially applicable in polynomial time. Thus, in the rest of 
this section we assume that the input graph G contains no loops, only single 
and double edges, and has minimum degree at least 3. Our goal is to develop 
further, more involved reduction rules than the one described above, aiming 
at a polynomial kernel for FEEDBACK VERTEX SET. 


We now focus on a vertex of large degree in G. It seems reasonable to 
suspect that it should be possible to reduce such vertices: they tend to par- 
ticipate in many cycles in G, and hence they are likely to be included in 
a solution X. If this is not the case, then there is some hope that only a 
small part of the neighborhood of such a vertex actually plays some “signif- 
icant role” in the FEEDBACK VERTEX SET problem, and consequently this 
neighborhood can be simplified. 

To strengthen the motivation for this approach, let us observe that an 
upper bound on the maximum degree actually yields immediately the desired 
kernel. 


Lemma 9.1. If a graph G has minimum degree at least 3, mazimum degree 
at most d, and a feedback vertex set of size at most k, then it has less than 
(d+ 1)k vertices and less than 2dk edges. 


Proof. Let X be a feedback vertex set of G of size at most k, and let Y = 
V (G) \ X. Denote by F the set of edges of G with one endpoint in X and the 
other in Y. We estimate the size of F from two perspectives — first using 
the set X, and second using the set Y. 

First, by the bound on the maximum degree we have that |F| < d|X]. 
Second, since G \ X = G[Y] is a forest, G[Y] contains less than |Y | edges; in 
other words, G[Y] has average degree lower than 2. Therefore, 


X da(v) = X INON XI A NW) NY] < |F| + 2Y]. 
vEY vEY vEY 
On the other hand, the minimum degree of G is at least 3. Thus, 
Š da(v) > 3l¥1. 
vEY 


Consequently, we obtain |Y | < |F| < d| X| and hence 


|V(G)| = |X| + |¥| < (d+ 1)|X| < (d+ 1)k. 
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To estimate the number of edges of G, we again use the bound on the 
maximum degree: there are at most d|X| edges incident to X, and all other 
edges of G are contained in G[Y]. As G[Y] is a forest, it has less than |Y] 
edges. Consequently, 


|E(G)| < d|X|+|Y| < 2d|X| < 2dk. 














Formally, Lemma [9.1] supports the following reduction rule. 


Reduction FVS.6. If |V(G)| > (d+ 1)k or |E(G)| > 2dk, where d is the 
maximum degree of G, then terminate the algorithm and return that (G, k) 
is a no-instance. 


In particular, if the maximum degree of G is less than 7k, then we have 
|V(G)| < 7k? and |E(G)| < 14k?, or Reduction [FVS.6]triggers. Therefore, in 
the rest of this section, we pick a vertex v € V(G) of degree d > 7k and we 
aim either (a) to conclude that v needs to be included in any solution to the 
instance (G,k), or (b) to simplify the neighborhood of v. 

What structure in G will certify that v needs to be included in any solution 
to (G,k)? If we think of FEEDBACK VERTEX SET as a hitting problem the 
following answer is natural: a family of k+1 cycles C1, C2,...,Ch41 in G, all 
passing through v, such that no two cycles C; share any vertex except for v. 
We will call such a structure a flower, with cycles C; being petals and vertex 
v being the core. (Observe that these terms are almost consistent with the 
notion of sunflower defined in Section [2.6] the sets V(C1),...,V(Cx41) form 
a sunflower with & petals and core {v}.) 

Clearly, a flower with k + 1 petals implies that v is in every solution of 
size at most k, as otherwise the solution would need to contain at least one 
vertex of C; \ {v} for every 1 < i < k + 1. Therefore, if there is such a 
flower, then we can simplify the instance by deleting vertex v. If there is no 
such flower, then we would like to use this fact to obtain some structural 
information about the graph that allows further reduction. But how can we 
find flowers with a certain number of petals and what can we deduce from the 
nonexistence of flowers? It turns out that finding flowers can be formulated 
as a disjoint paths problem between neighbors of v, which can be reduced to 
maximum matching (in a not necessarily bipartite graph). Then we can use 
classic matching algorithms to find the maximum number of petals and, if we 
do not find k + 1 petals, we can use characterization theorems for maximum 
matching to deduce structural information about the graph. 

The following theorem of Gallai (see also Fig. [9.1] for an illustration) for- 
mulates the tool that we need; the next section is devoted to its proof. For 
brevity, given a set T C V(G), by a T-path we mean a path of positive length 
with both endpoints in T. 


Theorem 9.2 (Gallai). Given a simple graph G, a set T C V(G), and an 
integer s, one can in polynomial time either 
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Fig. 9.1: Two possible outcomes of Gallai’s theorem: either a large family of 
vertex-disjoint T-paths (on the left) or a small hitting set B for all T-paths 
(on the right). Vertices of T are depicted as squares, whereas all other vertices 
of G are circles 


1. find a family of s+ 1 pairwise vertex-disjoint T-paths, or 

2. conclude that no such family exists and, moreover, find a set B of at most 
2s vertices, such that in G\ B no connected component contains more 
than one vertex of T. 


9.1.1 Proof of Gallai’s theorem 


The main step in the proof of Theorem [9.2] is to translate the question into 
the language of matching theory. Consider a graph H constructed from G by 
duplicating each vertex of V(G) \ T and connecting the copies by an edge 
(see Fig. (9.2). Formally, 


V(A) =TU{e',2? : cE V(G)\T}, 

E(H) = E(G|T]) U {uzx', ua? : ue T,2 € V(G)\T, uz € E(G)} 
Uf{aly! aly, a7y',2?y? : x,y € V(G)\T, ay € E(G)} 
Uf{ata? : 2 €V(G)\T}. 


The crucial observation is the following. 


Lemma 9.3. For any nonnegative integer s, there exist s pairwise vertes- 
disjoint T-paths in G if and only if there exists a matching in H of size 
s+ |V(G)\ Tl. 


Proof. Let P be a family of pairwise vertex-disjoint T-paths in G. With- 
out loss of generality, we may assume that the internal vertices of the 
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Fig. 9.2: Construction of the graph H. The matching Mo is depicted with 
dashed lines 


paths in P are disjoint from T, as otherwise such a path may be short- 
ened. For any P € P, define a path PË in H as follows: if the sequence 


of consecutive vertices of P equals u,71,%2,...,2|pj-1,w where u,w E T, 
|P| denotes the number of edges of P, and z; ¢ T for 1 < i < |P], 
then let PË = U, £1, 27,0, 23,---, Lp) 1,27 pj- W- Moreover, let Mo = 


{a'x? : x € V(G)\T} be a matching of size |V(G) \ T| in H. Observe 
that every path P” has every second edge in the matching Mo. Let us de- 
note E9(P”) := E(P#) A Mo to be the set of edges of Mo on P¥, and 
E,(P") := E(P#) \ Mo to be the set of remaining edges of PH. Note that 
the path P” starts and ends with an edge of E,(P”) and, consequently, 
|Eo(P#)| +1 =|E1(P#)|. Construct a set of edges M as follows: start with 
M := Mo and then, for every P € P, replace Eo(P#) C Mo with E, (P¥). 
(In other words, treat P” as an augmenting path for the matching Mb.) 
Since the paths of P are pairwise vertex-disjoint, the sets Eo(P#) for P € P 
are pairwise disjoint and, moreover, M is a matching in H. Finally, note that 
IM] = |Mo| + P| = |V(G)\ TI + |P]. 

In the other direction, let M be a matching in H of size s + |V(G)\ T| = 
s + |Mo|. Consider the symmetric difference MjpAM. Note that the graph 
(V(G), Mo AM) has maximum degree 2, so it consists of paths and cycles 
only, where edges of Mo and M appear on these paths and cycles alternately. 
Observe also that the only connected components of (V(G), Mj) AM) that 
contain more edges of M than of Mo are paths that begin and end with an 
edge of M. Since |M|—|Mo| = s, we have at least s such paths, and they are 
pairwise vertex-disjoint. Consider one such path Q. By the definition of Mo, 
path Q starts and ends in a vertex of T and no internal vertex of Q lies in T. 
Consider a path QC being a “collapsed” version of Q in the graph G, defined 
as follows: the endpoints of Q© are the same as those of Q, while whenever 
the path Q traverses an edge «tx? € Mo, we add the vertex x € V(G) \T to 
the path QS. It is easy to see that Q© is a T-path in G and, furthermore, 
if we proceed with this construction for at least s vertex-disjoint paths Q in 
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Fig. 9.3: Equivalence of a T-path in G and an augmenting path to the match- 
ing Mo in H 


H, we obtain at least s vertex-disjoint T-paths in G. This finishes the proof 
of the lemma (see also Fig. (9.3). Oo 





Lemma [þ.3] gives us the desired connection between T-paths and matching 
theory. Observe that to compute the maximum number of vertex-disjoint T- 
paths in G, we simply need to compute a maximum matching in H; in fact, 
the proof of Lemma [9.3] gives us a way to efficiently transform a maximum 
matching in H into a family of vertex-disjoint T-paths in G of maximum 
possible cardinality. Consequently, Lemma allows us to check in Theo- 
rem [9.2] whether the requested number of T-paths can be found. However, to 
obtain the last conclusion of Theorem [9.2|— a hitting set B for all T-paths 
— we need to study more deeply possible obstructions for the existence of 
a large number of vertex-disjoint T-paths. By Lemma [9.3 such an obstruc- 
tion should, at the same time, be an obstruction to the existence of a large 
matching in H. Luckily, these obstructions are captured by the well-known 
Tutte-Berge formula. 


Theorem 9.4 (Tutte-Berge formula, see [338]). For any graph H, the 
following holds: 


|V(H)| ~ 2v(H) = max, odd(H — 8) — |S], 


where v(H) denotes the size of a maximum matching in H and odd(H — S) 
denotes the number of connected components of H — S with an odd number 
of vertices. 

Furthermore, a set S for which the maximum on the right-hand side is 
attained can be found in polynomial time. 


We now use Theorem [9.4] to obtain the set B of Theorem while the 
algorithmic statement of Theorem [9.4] will ensure that the set B is efficiently 
computable. 
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For a set X C V(G), by C(X) we denote the family of connected com- 
ponents of G — X. In our proof we study the following quantity, defined for 
every X C V(G): 





eo D [E 


cec(x) 


Fix a set X C V(G) and observe that every T-path in G either contains 
a vertex of X, or is completely contained in one connected component of 
C(X). In a family of pairwise vertex-disjoint T-paths there can be at most 
|X| paths of the first type, while a component C € C(X) can accommodate 
at most ||C N T|/2] pairwise vertex-disjoint T-paths since every such path 
has both endpoints in T. Consequently, G cannot accommodate more than 
E(X) pairwise vertex-disjoint T-paths. 

As an intermediate step to prove Theorem we prove that in fact the 
upper bound given by (X) is tight: 


Lemma 9.5. The maximum possible cardinality of a family of pairwise vertex- 
disjoint T-paths in G equals minx cya) E(X). Moreover, one can in polyno- 
mial time compute a set W C V(G) satisfying €(W) = minxcya) E(X). 


Proof. Recall that, by Lemma]9.3} the maximum possible number of pairwise 
vertex-disjoint T-paths in G equals v(H) —|V(G) \ T|, where we unravel the 
quantity v(H) with Theorem |9.4 

For brevity, for every S C V(H) we define the deficiency of S as 
df(S) = odd(H — S) — |S|. We start with invoking the algorithmic state- 
ment of Theorem [9.4] to compute the set So of maximum deficiency in H, 
that is, with df(Sg) = |V(H)| — 2v(#). 

Assume that, for some x € V(G)\T, the set So contains exactly one vertex 
among the copies z+, x? of x in H; w.l.o.g., let x! € So and x? ¢ So. Consider a 
set Sh = So\ {xt}. The crucial observation is that, as xt and x? are true twins 
in H (that is, they have exactly the same closed neighborhoods), all neighbors 
of x! in H are contained either in Sọ or in the connected component of H — So 
that contains «?. Consequently, the sets of connected components of H — So 
and H—S differ only in one element: the connected component that contains 
x?. We infer that odd(H — S4) > odd(H — So) —1. As |S] = |So|—1, we have 
df(S$) > df(So). By the maximality of So, the set $4 is of maximum deficiency 
as well. Consequently, we may consider Sj instead of So. By repeating this 
process exhaustively, we may assume that for any x € V(G) \ T, either both 
x! and x? belong to So or neither of them does. 

Let WT = {x € V(G)\T : a!,2? € So} be the projection of So onto 
V(G)\T, let WT = So AT, and let W = WT U WT. Note that it suffices 
to prove that there exists a family of €(W) pairwise vertex-disjoint T-paths 
in G; to show this claim we will strongly rely on the fact that W has been 
constructed using a set So of maximum deficiency in H. 
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Recall that for any x € V(G) \ T, either both x! and z? belong to So 
or neither of them does. Consequently, every component of H — So contains 
either both z! and z? or neither of them, which implies that a component of 
H — So has odd size if and only if it contains an odd number of vertices of T. 
Since the connected components of H — So are in one-to-one correspondance 
with the components of C(W), we infer that 





y ICAT] con) = o 
2 7 2 
cec(W) 
Since X` cec(w) |C N T| = |T \ W| and the Tutte-Berge formula implies that 
df (So) = |V(H)| — 2v(H), we obtain the following: 

















eW)=|Wi+ $ em 
Cec(w) 
-W + 5 (IT\ WI — oda(H — 50)) 
= W] + 5 (IT| — [W7] — (|S0| + d£(50))) 
= W] +5 (IT — W7] = (W7] + 217 |) — VED] - 20) 
= |W] + 5 (IT| — 21W] — [VD + 204) 


= v(H) — 5|V(H)\T| = v(Ħ) — IV (G)\ T]. 











This finishes the proof of the lemma. 





Recall that our goal is to construct a small set B that intersects every 
T-path in G. The set W of Lemma [9.5] almost meets our needs, but some 
connected components of G— W may contain more than one vertex of T. We 
fix it in the simplest possible way: we start with B := W computed using 
Lemmaf9.5]and then, for each connected component C of G—W that contains 
at least one vertex of T, we insert into B all vertices of CMT except for one. 
Clearly, by construction, each connected component of G — B contains at 
most one vertex of T. It remains to show the required upper bound on the 
size of B. Observe that: 


ICnT| 
= — < ——— | < 
|B) =|Wl+ X max(0,|CNT|-1) < |W]+2 X | 5 | < 28(W). 
Cec(W) cec(W) 


Consequently, by Lemma|9.5|the cardinality of B is not larger than twice 
the maximum cardinality of a family of pairwise vertex-disjoint T-paths in 
G. This concludes the proof of Theorem |9.2 
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9.1.2 Detecting flowers with Gallai’s theorem 


We would like to apply Gallai’s theorem to detect flowers with the core at 
some v € V(G), in order to efficiently apply the following reduction rule: 


Reduction FVS.7. If there exists a vertex v € V(G) and a flower with core 
v and more than k petals, then delete v and decrease k by 1. 


The safeness of Reduction is immediate. In what follows we argue 
how to apply Reduction [FVS.7]|in polynomial time; a small difficulty is that 
we need to be careful with multiple edges that may be present in G. Fix 
one vertex v € V(G). Observe that multiple edges not incident to v behave 
as single edges from the point of view of flowers with core v: for any such 
multiple edge xy, only one petal of the flower may use one of the copies of 
zy and it does not matter which copy is used. Moreover, any double edge 
incident to v can be greedily taken as a petal to the constructed flower: if 
vx is a double edge in G, then in any flower, at most one petal uses the 
vertex x and it can be replaced with the cycle consisting of both copies of vz. 
Consequently, define D to be the set of vertices x such that vax is a double 
edge in G, and let G be the graph G \ D with all double edges replaced with 
single ones. If |D| > k, then we can immediately trigger Reduction [FVS.7]on 
v. Otherwise, as we have argued, the existence of a flower with s > |D| petals 
and core v in G is equivalent to the existence of a flower with s — |D| petals 
and core v in G. 7 
Since the graph Gis simple, we can apply Theorem|9.2|to the graph G\{v}, 
the set T = Na(v), and the threshold s = k—|D]. Clearly, T-paths in G\ {o} 


are in one-to-one correspondence with cycles in G passing through v. Hence, 
by Theorem [9.2] we either obtain a flower with at least (k — |D| + 1) petals 
and core v in G, or we learn that no such flower exists and we get a set B 
of size at most 2k — 2|D| that intersects all cycles in G passing through v. In 
the first case, together with |D| double edges incident to v we obtain a flower 
with (k + 1) petals, and we may trigger Reduction [FVS.7] on v. 

Otherwise, we not only learn that there is no flower with core v and more 
than k petals, but we also obtain the set B that intersects all cycles through 
v in G. Define Z = BU D. The following lemma summarizes the properties 
of Z. 


Lemma 9.6. If v € V(G) and there does not exist a flower in G with core 
v and more than k petals, then we can in polynomial time compute a set 
Z C V(G)\{v} with the following properties: Z intersects every cycle that 
passes through v in G, |Z| < 2k and there are at most 2k edges incident to v 
and with second endpoint in Z. 


Proof. The bound on the number of edges between v and Z follows from the 
bound |B| < 2k — 2|D| and the fact that for every vertex u € D there are 
exactly two edges uv in G, whereas for every w € B there is at most one edge 
uw. The remaining claims are immediate. 
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Fig. 9.4: The structure of the graph G with the set Z 


9.1.3 Exploiting the blocker 


In the rest of this section, we are going to use the structure given by the set 


Z to derive further reduction rules. 


Let us first give some intuition. Assume that vertex v has degree at least 
ck, for some large constant c (in the proof we use c = 7) and assume 
further Reduction[FVS.7]is not applicable. First, note that double edges 
incident to v contribute at most 2k to its degree. Obtain a set Z from 
Lemmaj9.6|for v and look at the connected components of G'\ ({v}UZ); 
each of them is connected with at most one edge to v, and hence there 
are at least (c — 2)k of them. Moreover, if more than k of them are not 
trees, then we obtain k + 1 pairwise vertex-disjoint cycles in G and we 
can conclude that we are dealing with a no-instance. Hence, in the re- 
maining case at most k of the components are not trees. Consequently, 
we obtain at least (c — 3)k components that have a very simple struc- 
ture: They are trees, they are connected to v by exactly one edge, and 
they can have some neighborhood in the set Z. On the other hand, Z 
is relatively small when compared to the total number of these com- 
ponents. The intuition is that most of these components are actually 
meaningless from the point of view of hitting cycles passing through v, 
and hence can be reduced. 


For the rest of this section, we assume that we work with a vertex v with 
degree d > 7k. Let C be the family of connected components of G \ ({v}U 
Z). We first observe that most of the components of C have to be trees, as 
otherwise we obtain more than k vertex-disjoint cycles in G; see also Fig. 


for an illustration. 
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Reduction FVS.8. If more than k components of C contain cycles (i.e., are 
not trees), terminate the algorithm and return that (G, k) is a no-instance. 


As Z intersects all cycles passing through v, for any C € C at most one edge 
connects v with C. The next important observation is that many components 
of C are connected to v with exactly one edge. 


Lemma 9.7. There are at least 4k components of C that are trees and are 
connected to v with exactly one edge. 


Proof. Recall that the degree of v equals d > 7k and that, by Lemma 
there are at most 2k edges incident to z and a vertex of Z. Thus, there are 
at least 5k components of C that are connected with v with exactly one edge. 
If Reduction [FVS.8]is inapplicable, at most k of them contain cycles. Hence, 
at least 4k of them satisfy the desired properties. 














Let D be the family of components of C that satisfy the requirements of 
Lemma |9.7| that is, for each A € D it holds that G[A] is a tree and there 
exists exactly one edge incident to both v and a vertex of C. 

We now construct an auxiliary bipartite graph H, as follows: one biparti- 
tion class is Z, the second one is D, and an edge connects z € Z with A€ D 
if and only if there exists an edge in G between z and some vertex in A. 
Since Reduction is inapplicable, and G[A] is a tree connected to v 
with only one edge, for any A € D, some vertices of A need to be adjacent to 
Z in G. Consequently, no vertex of H, in the second bipartition class (i.e., 
D) is isolated. Moreover, |Z| < 2k — |D| and |D| > 4k. Hence, we may use 
the g-expansion lemma for q = 2 (Lemma|2.18) to the graph H,, obtaining 
a 2-expansion in H,. More precisely, in polynomial time we can compute a 
nonempty set Z C Z and a family D C D such that: 


1. Ny, (D) = Z, that is, Ne(UD) = ZU {v}; 
2. each z € Z has two private components A!, A2 € D such that z € Ng(A!) 
and z € N@(A?). Here, by private we mean that components A}, A? are 


all different for different z € Z. 
We claim the following. 


Lemma 9.8. For any feedback vertex set X in G that does not contain v, 
there exists a feedback vertex set X’ in G such that |X'| < |X| and ZC X’. 


Proof. Consider a feedback vertex set X in G with v ¢ X. Define 
= (x\U5) UZ. 


First, we show that X’ is a feedback vertex set of G. Let C be any cycle in G. 
If C does not intersect any component of D, then the vertex of X that hits 
C belongs also to X’, by the definition of X’. Suppose then that C intersects 
some A € D. Recall that G[A] is a tree, Ng(A) C ZU {v} and there is exactly 
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one edge connecting A and v in G. Hence, C has to pass through at least one 
vertex of Z, which is included in X’. We conclude that in both cases C is hit 
by X’, and so X’ is a feedback vertex set of G. 

We now bound the size of X’. For any z € Z, consider a cycle C, that 
starts from v, passes through A! to z and goes back to v through A?. The 
family of cycles {C, : z € Z } forms a flower with core v and a petal for each 
z € Z. Since v ¢ X, we infer that X needs to contain at least IZI vertices 
from ZU UD. Consequently, we have |X’| < |X|, and the lemma is proved. 














Lemma [9.3] allows us to greedily choose either v or Z into the solution 
feedback vertex set. If we were to design an FPT algorithm we would simply 
branch; however, in the case of kernelization — which is a polynomial-time 
algorithm — we need to encode such a behavior using some sort of gadgets. 
Note that we need to be careful: we cannot simply erase the components 
of D as, in the case of deletion of v, they may play an important role with 
respect to cycles passing through some vertices of Zz (but not v), and we do 
not properly understand this role. However, what we can do is simplify the 
adjacency between v and DU Z. Recall that, due to Lemma 9.1| it suffices 
if our reduction decreases the degree of a high-degree vertex v. 


Reduction FVS.9. Delete all edges between v and UD, and make v adja- 
cent to each vertex of Z with a double edge. 


Let us now formally verify that Reduction |FVS.9]is safe. Denote by G” the 
graph constructed from G by its application. 


Lemma 9.9. The minimum possible sizes of a feedback vertex set in G and 
in G’ are equal. 


Proof. Let_X be a feedback vertex set in G of minimum possible size. By 
Lemma we may assume that v € X or Z C X. However, we have G\ {v} = 
G! \ {v}, whereas G” \ Z is a subgraph of G \ Z. Hence, in both cases, G” \ X 
is a subgraph of G \ X, and X is a feedback vertex set of G” as well. 

In the second direction, let X be a feedback vertex set in G”. As v is 
connected with every vertex of Z with double edge in G’, we have v € X 
or Z C X. In the first case, as before, G \ {v} = G’ \ {v} and hence X is a 
feedback vertex set in G as well. Suppose then that Z C X. Then any cycle 
in G \ X would need to pass through some edge of E(G) \ E(G’), that is, 
through an edge connecting v with some A € D. However, since G[A] is a tree 
connected to v with only one edge, such a cycle would need to pass through 
some vertex of Z. This is a contradiction with Z C X, and so the lemma is 
proved. 














Note that a priori it is not obvious that Reduction |FVS.9] actually makes 
some simplification of the graph, since it substitutes some set of edges with 
some other set of double edges. Therefore, we need to formally prove that 
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the reduction rules cannot be applied infinitely, or superpolynomially many 
times. We shall do this using a potential method: we define a measure of the 
instance at hand, which (a) is never negative, (b) is initially polynomially 
bounded by the size of the instance, and (c) strictly decreases whenever any 
of the reductions is applied. For an instance (G, k), let E2(G) be the set of 
all the loops and edges of G except for edges of multiplicity 2. We define 


2(G) = 2|V(G)| + |E-2(G)]. 


We now claim that potential ¢ strictly decreases whenever applying some 
reduction rule (of course, providing that the rule did not terminate the algo- 
rithm). This holds trivially for all the reductions except for Reduction [FVS.9] 
and Reduction For Reduction we remove a nonempty set of 
single edges from the graph, thus decreasing |E42(G)|, while the introduced 
double edges are not counted in this summand. For Reduction the 
nontrivial situation is when the reduction contracts one copy of an edge of 
multiplicity 2, and thus we obtain a new loop that is counted in summand 
|E_2(G)|. However, the reduction also removes one vertex of the graph, and so 
the summand 2|V(G)| decreases by 2. Hence the whole potential ¢ decreases 
by at least 1 in this case. 

As each reduction is applicable in polynomial time, the kernelization algo- 
rithm in polynomial time either terminates and concludes that we are dealing 
with a no-instance, or concludes that no rule is applicable. In this case the 
graph G has maximum degree less than 7k, and by the inapplicability of 
Reduction [FVS.6| we have |V(G)| < 7k? and |E(G)| < 14k?. This concludes 
the description of the quadratic kernel for FEEDBACK VERTEX SET. We can 
summarize the result obtained in this section with the following theorem. 


Theorem 9.10. FEEDBACK VERTEX SET admits a kernel with at most 7k? 
vertices and 14k? edges. 


*9.2 Moments and MAx-Er-SAT 


The solution size or the value of an objective function is often a natural 
parameterization. However, some problems are trivially FPT with such a 
measure, as in nontrivial instances the value we are looking for is proportional 
to the input size. 

This is exactly the case for the MAXIMUM SATISFIABILITY problem, where 
given a CNF formula ọ with n variables z1, ..., £n and m clauses C1, ..., Cm, 
we are to find an assignment with the maximum number of satisfied clauses. 
Formally, in the decision version, the instance is additionally equipped with 
an integer k, and we ask for an assignment w that satisfies at least k clauses. 
Observe the following: if we take any assignment 7 and its negation w, then 
any clause C; is satisfied by at least one of these assignments. Consequently, 
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either y% or y satisfies at least m/2 clauses of y, and all MAXIMUM SATISFI- 
ABILITY instances (y,k) with k < m/2 are yes-instances. 

If you recall the main principle of parameterized complexity — do some- 
thing smart when the parameter is small — then, clearly, parameterization 
by & does not make much sense for MAXIMUM SATISFIABILITY: in all inter- 
esting instances, the number of clauses is bounded by 2k. However, one can 
consider also a more refined way to parameterize MAXIMUM SATISFIABILITY, 
namely the so-called above guarantee parameterization. Define k’ = k —m/2 
in a MAXIMUM SATISFIABILITY instance (y, k), and consider MAXIMUM SAT- 
ISFIABILITY parameterized by k’, that is, the excess above the “guaranteed” 
solution value of at least m/2 satisfied clauses. Recall that in Chapter 
we have seen an example of above guarantee parameterizations of VERTEX 
COVER. 

The study of the above guarantee parameterization of MAXIMUM SATIS- 
FIABILITY is deferred to Exercise [9.3] where you are asked to show its fixed- 
parameter tractability. In this section, we consider, as an example, a more 
specific variant of the MAXIMUM SATISFIABILITY problem, namely MAx-Er- 
SAT. Here we assume that the input formula y is an r-CNF formula for 
some positive integer r, that is, each clause of y consists of exactly r liter- 
als. Moreover, we assume that literals in one clause pairwise involve different 
variables, i.e., there are no two identical literals and no literal is the negation 
of another one. Therefore, requiring that each clause consists of exactly r 
literals (as opposed to the more usual requirement of at most r literals) is a 
nontrivial condition: for example, we cannot just repeat one of the literals of 
a clause with less than r literals to increase its size to exactly r. The more 
“standard” version, where every clause contains at most r literals, is called 
MaAx-r-SAT. 

For an r-CNF formula one can show a stronger lower bound on the maxi- 
mum number of satisfied clauses than for a general CNF formula. 


Lemma 9.11. For an r-CNF formula y with m clauses, there exists an as- 
signment satisfying at least m(1 — 27") clauses. 


Proof. Consider a random assignment p, where each variable x; is indepen- 
dently at random assigned true or false value with probability 4 each. A 
clause C; is falsified by w if and only if all its literals evaluate to false with 
respect to ~, which happens with probability exactly 27". By the linearity of 


expectation, the expected number of satisfied clauses of y is m(1 — 27"), as 


E(number of satisfied clauses) = 5 Pr(C; is satisfied) = m(1 — 27"). 


1<i<m 


Therefore, there exists an assignment satisfying at least m(1 — 27") clauses. 














Because of the Lemma we know that it is always possible to satisfy 
at least fraction (1 — 27”) of all the clauses. Following the reasoning for 
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MAXIMUM SATISFIABILITY from the beginning of this section, we consider 
the above guarantee parameterization of the MAx-Er-SAT problem, where 
k' = k—m(1—27") is our new parameter. We shall show that MAx-Er-SAT 
is FPT when parameterized by k’ using the following approach. 


1. We change the setting and reduce the problem to checking whether 
some low-degree polynomial X constructed from the input formula 
y can take value at least 2"k’ when each variable is assigned either 
l or —1. 

2. When we choose the values of variables uniformly and independently 
at random from {—1, 1}, then the mean value of X is 0. The crucial 
step is to prove that if X contains many monomials (in terms of 
k’), then the distribution of its values must be very “spread” around 
the mean. More precisely, we prove an anti-concentration inequality 
showing that X attains value larger or equal to 2"k’ with positive 
probability, providing that it has sufficiently many monomials. 

3. Hence, we can either conclude that we are definitely working with 
a yes-instance, or the polynomial X has only few monomials (in 
terms of k’). Thus, in polynomial time we have compressed the 
given instance to size polynomial in k’, which leads to a polynomial 
kernel and an FPT algorithm. 


9.2.1 Algebraic representation 


We are going to encode the formula y with clauses Cls(y) = {C1, Co,...,Cm} 
and variables Vars(y) = {21,22,...,2n} as a polynomial. For a clause Cj, 
we denote by Vars(C;) the set of variables appearing in literals of C;. Let us 
identify the Boolean value true with the real number 1 and the value false 
with —1. Consequently, an assignment is a function Y: {£1, £2,..., 2n} > 
{—1,1}. The crucial algebraic object in our considerations will be the follow- 
ing polynomial: 


X(@1,%2,..-,2n) = 5 1- II (1+€2,,c,2%) |, (9.1) 


l<i<m a j€Vars(Ci) 


where €,,,¢, = —1 if x; appears in C; positively, while €,,,0o, = 1 if xj 
appears in C; negatively. Observe that if an assignment satisfies the clause 
Ci, then the corresponding product is 0. If we consider r to be a constant, 
then polynomial X can be written explicitly as a sum of monomials in linear 
time: expanding each of the inner products results in at most 2” monomials, 
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and with the additional 1 in each summand we have at most (2” + 1)-m 
monomials in total. As we shall later see, the input instance is trivially a 
yes-instance, unless this number of monomials is much smaller (bounded in 
terms of k’); that is, if most of the summands in the expansion of formula 
(9.1) actually cancel out. 

We now establish a relation between polynomial X and the maximum 
number of clauses that may be satisfied in formula y. In what follows, for an 
assignment p, we use a shorthand X (y) := X (Y(x1), Y(x2),..-, Y(£n))- 


Lemma 9.12. For an assignment p: {x1, ..., £n} > {—1,1}, we have 
X (yp) = 2 - (sat (9,9) — (1-2-7) m), 
where sat (p, Y) denotes the number of clauses of p that w satisfies. 
Proof. Observe that Hacc, (1 + Ex; citi) equals 2” if C is falsified and 0 


otherwise. Hence, 


X(~) =m— 5 [C; not satisfied in 4%] - 2” 
CiECls(ẹ) 
= m — (m — sat (p,)) - 2" 
= 2" . (sat (p, Y) — (1 — 27") m). 














Lemma allows us to reformulate the problem using polynomial X: 
The question whether there is an assignment satisfying at least k = m(1 — 


27") +k’ clauses of ọ is equivalent to asking whether there is an assignment 
WỌ : {£1, En} > {—1, 1} for which X(w) > 27k’. 


Our strategy is to show that if X has sufficiently many monomials, 
then for some ~ we always have X (y) > 27k’. To achieve this goal, 
we consider y as a random vector that picks the value of each variable 
independently and uniformly at random, and we study the first few 
moments of the random variable X (y). The desired anti-concentration 
property will follow from standard probabilistic inequalities relating the 
distribution of a random variable with its moments. 


9.2.2 Tools from probability theory 


We are going to use the following well-known inequality. 


Lemma 9.13 (Hölder’s inequality, probabilistic version). For any re- 
als p,q > 1 satisfying 3 + i = 1 and any real random variables Y and Z the 
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following holds: 
1 1 
E(Y Z|) < (E(IY|?))” - (EQZ]9))* . 


We remark that the special case p = q = 2 is often called the Cauchy- 
Schwarz inequality. 


Lemma 9.14 (Cauchy-Schwarz inequality, probabilistic version). For 
any real random variables Y and Z the following holds: 





E(\Y¥Z|) < VE(IY |?) - E(|Z]?). 
We use Hölder’s inequality to relate the second and the fourth moment of 


a random variable. 


Lemma 9.15. Assume that X is a real random variable with finite and pos- 
itive fourth moment (i.e., 0 < E(X*) < œ). Then 


niw 


(E(x?)) 


E(X) > =. 
(E(x)? 
Proof. Take Y = |X|3, Z = |X|3, p = 3, q = 3. Using Hélder’s inequality 
we obtain P i 

E(X?) < ŒX? - (E(X4)) 5. 











The lemma follows by standard algebraic manipulations. 





The following corollary shows that the only way a zero-mean random vari- 
able X may achieve a high second moment in comparison to its fourth mo- 
ment is to attain high values with positive probability. 


Corollary 9.16. Let X be a random variable and suppose that E(X) = 0, 
E(X?) =o7, and 0 < E(X*) <b-ot. Then 


o 
Pr| X > —- | >0. 
(e35) 
Proof. By Lemma we have E(|X|) > z Since E(X) = 0, by the law 
of total expectation, we have 


Pr(X > 0)-E(X|X > 0)+Pr(X <0)-E(X|X <0)=0 (9.2) 


(note that we do not have to consider the contribution of the term Pr(X = 0)- 
E(X|X =0), as it is obviously zero). Since E(X“) > 0, at least one of 
Pr(X > 0) and Pr(X < 0) has to be nonzero and if one of them is nonzero, 
then E(X) = 0 implies that both of them are nonzero. By we infer 


Pr(X > 0)-E(X|X > 0) = Pr(X < 0)-E(—X|X < 0). (9.3) 


On the other hand, from the law of total expectation applied to |X| we have 
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(|X|) = Pr(X > 0) -E(X|X > 0) + Pr(X <0)-E(—X|X <0). (9.4) 


Combining (9.3) and (9.4) we obtain 


BUX) o 


2 2b 





Pr(X > 0)-E(X|X > 0) = 


we infer that X takes value at least ~2~ with 


Since E(X|X > 0) > I a7 


positive probability. 














9.2.3 Analyzing moments of X (p) 


Let X = X(z1,.., £n) be the polynomial defined in (9.1). As already men- 
tioned, we can represent X as a sum of monomials. More precisely, we apply 
exhaustively the distributivity of addition and multiplication on reals to the 
formula (9.1), and cancel out summands whenever possible. In this way, we 
obtain a representation of X as a sum of monomials X = J jes Xz, where 
sc (H) and for each I € S we have X; = cy -[],<,;%i for some cr F 0. 
Observe that X(0,0,...,0) = 0, so there is no constant term in X; in other 
words, Ø ¢ S. Moreover, X is a multilinear polynomial, that is, it is linear 
with respect to every variable (in the representation as a sum of monomials, 
in every monomial each variable appears with exponent 0 or 1). We remark 
here that, due to cancellation, |S| may be much smaller than the original 
number of summands in the expanded form of (9.1). 

Now we apply the derived inequalities to our polynomial, but in order to 
treat the value of X as a random variable, we first need to formalize the 
probability distribution we have discussed previously. Henceforth, we treat 
X as a random variable defined on the domain {—1, 1}”, where every variable 
x; takes value —1 or 1 uniformly at random (i.e., with probability z each), 
and values of all the variables are independent. As all the monomials in X 
have at least one variable, by the linearity of expectations and independence 
of variables, we have E(X) = 0. Let o? = E(X?) be the second moment of 
X. Now, we give a connection between o? and |S], that is, the number of 
monomials of X. 


Lemma 9.17. o? > |S]. 


Proof. Observe that for any I,J € S, I # J, we have that E(X;X 7) = 
CICJ: B(Tlseray za) = 0, where TAJ is the symmetric difference of J and J. 


Using this, we obtain 
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E(X?) -»(() (zx) =X E(X + J E(X/X)) 


IES IES IES I,JES,IŻJ 
=ġ_ R(X} = 2 = |S}. 
IES IES 











The last inequality holds because every c; is a nonzero integer. 





The remaining prerequisite of Corollary is an upper bound on the 
fourth moment of X, which we prove by induction on n. For this reason, we 
state the following lemma in a general way. 


Lemma 9.18 (Hypercontractive inequality). Let X be a multilinear 
polynomial of degree at most r. Then, E(X*) < 9" (EOe))*. 


Proof. We prove the lemma by induction on n, the number of variables. In 
the base case assume n = 0, which means that X consists only of a constant 
term c (potentially c = 0). Then E(X*) = c* = (E(X?))* and the inequality 
holds. 

In the inductive step we assume that n > 1, and let us express the poly- 
nomial as X = P(z1,...,£n—1)* Un + Q(a1...,2%n—1). Observe that in this 
expression the values of P and Q depend only on the first n— 1 variables, and 
are independent of the value of xn. Consequently, since E(Y Z) = E(Y) E(Z) 
for independent random variables Y and Z, we obtain the following: 


E(X*) = E((Pa, + Q)*) 
= E(Ptrt + 4P322Q + 6P? x? Q? + 4Pr,Q? + Q*) 
= E(P*) E(x’) +4E(P%Q) E(22) +6 E(P?Q?) E(x) 
“ss ~’” —S—" ~ 





=i =0 =1 
+ 4E(PQ*) E(an) + E(Q*) 
=0 


<9"! (E(P))” + 6B(P2Q?) +9" (E(Q?))’ . 


The last inequality comes from the induction hypothesis applied to P and 
Q. Notice that P consists of monomials whose number of variables is at most 
r — 1, while monomials in Q can still have exactly r variables. Using the 
Cauchy-Schwarz inequality, we obtain 


6E(P?Q?) <6- (B(P*))? (B(Q'))? . 


Then, by applying induction to P* and Q* on the right-hand side we get: 
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1 


E(X) < 9-1 (B(P2))? +6 (97-1 (B(P2))?)* (0 EA 





+9" (E(Q?))? 
= 97- (E(P?)}? +6.37- E(P?) -E(Q*) +9" (E(Q°))? 


< 9" (E(P?) + E(Q?))’. 
To analyze the right-hand side of the desired inequality, we use again the fact 
that P and Q are independent of £n. 
(E((Pen +@)"))° 
" (E(P?22) + 2E(2PQzrn) + E(Q?))” 
= 9" | E(P?) E(2?,) +2 E(PQ) E(an) + E(Q’) 
SS’ =x 
=1 = 
= 9" (E(P?) +E(Q?))’. 





This concludes the proof of the lemma. 











Having all the tools prepared, we are ready to show the main lemma of 
this section. 


Lemma 9.19. If |S| > 4-9" - 4" - k?, then there exists an assignment w : 
{@1,...,%n} > {-1,1} such that X (p) > 27k’. 


Proof. Since k’ > 0, we have S 4 0 and X is not a zero polynomial. It follows 
that E(X*) > 0. Having Lemma 9.18] we can use Corollary with b = 9", 


obtaining 
oO Oo 
Pr(X > )=P:(x2 7) oo, 
2-3" 2Vb 


However, Lemma implies that o? > |S|, whereas |S| > 4-9" - 4" - k’?. 


Therefore, 
2-3" -2"-k! 
Pr| X > ——___—_ ] > 0. 
(x2 aa) 





We conclude that there always exists an assignment 7 such that X(wW) > 
27. k. 














Theorem 9.20. The problem MAx-Er-SAT is fixed parameter tractable 
when parameterized by k! = k — m(1 — 27"), and can be solved in time 
O(m + 20%")), 


Proof. Recall that we treat r as a constant. As mentioned before, in O(m) 
time we compute the monomial representation of the polynomial X. If X 
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has at least 4-9" - 4" - k’? monomials, then, by Lemmas and we 
are working with a yes-instance, and we may provide a positive answer. If, 
however, X does not have that many monomials, then only O(k’”) variables 
appear in the simplified form of X and we can exhaustively check all the 
20(k°?) assignments. 














Lemma [p.19] implies that either X has many monomials and we can safely 
provide a positive answer to the problem (by Lemma [9-12}, or the number 
of monomials in X is bounded by O(k’?). Even though we have only used 
polynomial time, we have not yet obtained a kernel for the following two 
reasons. 

First, we have to bound the total instance size, not only the number of 
monomials. The encoding of the instance needs to include also the coeffi- 
cients standing in front of these monomials, and they can be a priori as large 
as m, the number of clauses. However, it is easy to verify that they will 
never be larger than m, which means that their binary representations are of 
length O(log m). To compress the total size of the encoding of the instance to 
polynomial in k’, we can apply the following standard trick. Providing that 
logm < k’, our instance is already of total size O(k’*) and we are done. 
However, if logm > k’?, then m > 2k” and we can solve our instance in poly- 
nomial time using Theorem In this manner, we can always compress 
the total bitsize, i.e., the number of bits in the binary representation, of the 
instance to O(k’*). 

Second, we have not obtained a kernel, because the polynomial X is not 
an instance of the MAx-Er-SAT problem we were working with initially. 
Such a shrinking algorithm where the target problem may be different from 
the source problem is called a polynomial compression; this notion will be 
formally stated in Definition Nevertheless, having a polynomial com- 
pression algorithm to a language which belongs to NP, like our problem with 
polynomial X, we can obtain also a standard polynomial kernel for the source 
problem: the only thing we need to do is apply an NP-hardness reduction that 
transforms the output of the compression back to an instance of MAx-Er- 
SAT. 


Theorem 9.21. The problem MAx-Er-SAT admits a polynomial kernel 
when parameterized by k' = k — m(1 — 27°). 


9.3 CONNECTED VERTEX COVER in planar graphs 


In Section [7.7|we have seen a simple example of how the technique of bidimen- 
sionality leads to subexponential parameterized algorithms in planar graphs. 
This powerful framework gives not only subexponential algorithms, but also 
efficient kernelization procedures and approximation schemes for many prob- 
lems. Moreover, the approach applies not only to planar graphs, but also to 
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more general graph classes, such as graphs with a fixed excluded minor. In this 
section, we present only the tip of the iceberg in the context of kernelization: 
We show a kernel with very few (at most 4k) vertices for the CONNECTED 
VERTEX COVER problem on planar graphs. 

Recall that a vertex cover of a graph G is a set of vertices X that hits all 
edges of G, that is, such that G\ X is an edgeless graph. A vertex cover X is a 
connected vertex cover if additionally GLX] is connected. In the CONNECTED 
VERTEX COVER problem, we look for a connected vertex cover of a given 
graph G whose size does not exceed the given budget k. We consider the 
parameterization by k. 


9.3.1 Plane graphs and Euler’s formula 


Recall that a planar graph is a graph that can be embedded into the plane 
or, equivalently, into the sphere, whereas a plane graph is a planar graph 
together with its one fixed embedding into the plane (or sphere). We refer 
the reader to Section [6.3] and to the appendix for precise definitions. 

Our starting point is Euler’s well-known formula. 


Theorem 9.22 (Euler’s formula). In any connected nonempty plane graph 
G, the number of faces is exactly 


|E(@)| - |V(G)| +2. 


We remark here that Euler’s formula remains true if we allow G to be a 
multigraph. 
A simple well-known corollary of Euler’s formula is the following lemma. 


Lemma 9.23. If G is a nonempty simple planar bipartite graph whose every 
connected component contains at least three vertices, then |E(G)| < 2|V(G)|— 
4. 


Proof. First, observe that we may consider each connected component of G 
separately. Hence, for the rest of the proof we additionally assume that G is 
connected and, by the assumptions of the lemma, |V(G)| > 3. 

Second, fix a planar embedding of G, and let f be the number of faces 
in this embedding. Since G is connected, every face in this embedding is 
homeomorphic to an open disc. For a face F, by the length of F we mean the 
length of the closed walk encircling F; note that this walk is not necessarily 
a simple cycle, as some cutvertices of G or bridges in G may be traversed 
more than once. 

Since |V(G)| > 3 and G is simple, each face of G is of length at least 3. 
On the other hand, since G is bipartite, each face is of even length. Hence, 
each face is of length at least 4. Observe that if we sum up the lengths of all 
faces we obtain twice the number of edges of the graph G. We infer 
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4f < 2|E(G)|. (9.5) 
By plugging (9.5) into Euler’s formula we obtain 


|E(G)| — |V(G)| + 2 < |E(G)|/2, 


which is equivalent to the claim of the lemma. 














9.3.2 A lemma on planar bipartite graphs 


Our main tool in this section is the following simple lemma. 


Lemma 9.24 (Planar neighborhood lemma). Let G be a simple planar 
bipartite graph with bipartition classes A and B. Moreover, assume that each 
vertex in B is of degree at least 3. Then |B| < 2|Al. 


Proof. First, observe that we can assume G does not contain isolated vertices: 
due to the degree assumption on the side B, such vertices may appear only 
on the side A and only contribute to the right-hand side of the inequality in 
question. Second, note that the lemma is trivial if G is empty. 

If G is nonempty and does not contain isolated vertices, then the degree 
assumption implies that every connected component of G has at least four 
vertices. Hence, the graph G satisfies the assumptions of Lemma and 
we obtain that |E(G)| < 2|V(G)| — 4. On the other hand, note that, due to 
the degree bound, it holds that |E(G)| > 3|B]. Hence, 3|B| < 2|V (G)| — 4 = 
2|A| + 2|B| — 4, and the lemma follows. 














Lemma although a simple corollary of Euler’s formula, turns out to 
be extremely useful in the context of kernelization in planar graphs, through 
the following application. 


Corollary 9.25. Let G be a planar graph and A C V(G) be an arbitrary set 
of vertices. Then there are at most 2|A| connected components of G \ A that 
are adjacent to more than two vertices of A. 


Proof. Define B to be the family of connected components of G \ A, and 
create an auxiliary bipartite graph H with bipartite classes A and 6 where 
a vertex a € A is adjacent to a component C € B if and only if there exists 
an edge between a and some vertex of C in G. In other words, to obtain H, 
we start with the graph G, contract all connected components of G \ A into 
single vertices, suppressing multiple edges, and delete all edges of G[A]. In 
particular, H is a planar simple bipartite graph. Lemma |9.24] implies that 
at most 2|A| elements of 6 are adjacent to more than two vertices of A. In 
the language of the graph G, this is exactly the desired conclusion of the 
lemma. 
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A typical usage of Corollary [9.25] in the context of kernelization is as 
follows: In the studied graph G, we have recognized some set A of 
small size (for example, the hypothetical solution or its approxima- 
tion). Corollary [9.25] immediately provides a bound on the number of 
connected components of G \ A that have more than two neighbors in 
A; think of such components as the ones with a complicated neighbor- 
hood in A. All the other components have at most two neighbors in A; 
such components have simple neighborhood in A. There is hope that 
some problem-specific reduction rules may be able to exploit the sim- 
plicity of this structure, and reduce the (a priori unbounded) number 
of components of the latter type. 


9.3.3 The case of CONNECTED VERTEX COVER 


We shall now see how Corollary [9.25] works in the simple example of CoN- 
NECTED VERTEX COVER. Let (G,k) be an input and assume for a moment 
that (G,k) is a yes-instance. Let X be a solution: a vertex cover of G such 
that G[X] is connected and |X| < k. 

Apply Corollary [9.25]to the set X in the graph G. The assumption that 
X is a vertex cover greatly simplifies the picture: each connected component 
of G\ X is actually a single vertex. Thus, by Corollary [9.25]there are at most 
2|X| < 2k vertices of G\ X of degree at least 3. In other words, we have 
obtained, almost for free, a bound on the number of vertices of degree at 
least 3 in the graph: at most k of them reside in X, and at most 2k outside 
X. The vertices of smaller degree will turn out to be simple enough to allow 
us to reduce them. 

We first deal with isolated vertices. Clearly, they are irrelevant to the 
problem. 


Reduction CVC.1. Delete all isolated vertices. 


Moreover, note that the requirement that the solution needs to be connected 
gives us the following rule. 


Reduction CVC.2. If G is not connected, then conclude that we are dealing 
with a no-instance. 


For the sake of formal correctness, we will need also the following trivial re- 
duction rule; it will allow us to avoid some border cases in the argumentation. 


Reduction CVC.3. If k < 0 or |V(G)| < 3, then terminate the algorithm 
and output an answer. 
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We now focus on vertices of degree 1. Let v be a vertex of degree 1 in G, and 
let u be its sole neighbor. As |V(G)| > 3 and G is connected, any connected 
vertex cover of G needs to include u: it needs to include u or v to hit the 
edge uv, but the only way to include v and remain connected is to include u 
as well. It is now useful to think of the vertex v as an “annotation”: the only 
role of v in the instance is to force the solution to take vertex u. From this 
perspective, it clearly does not make sense to have multiple annotations at 
one vertex, i.e., multiple degree-one vertices attached to the same neighbor, 
since they play exactly the same role in the instance. This intuitively justifies 
the safeness of the following reduction; the formal proof is left to the reader 
as Exercise 


Reduction CVC.4. If a vertex u has multiple neighbors of degree 1, then 
delete all but one of them. 


We now move to vertices of degree 2. Let v be such a vertex, and let u and 
uz be its neighbors. There are two cases to consider, depending on whether 
v is a cutvertex in G or not. 

Let us first focus on the case when v is a cutvertex. We claim that any 
connected vertex cover in G needs to contain v. Indeed, if a vertex cover X 
of G does not contain v, it needs to contain both u and ug. However, as v 
is a cutvertex, the only way to make G[X] connected is to include v into X 
as well. 

Consider now a graph G’, where a vertex v with its incident edges is 
replaced with an edge u1u2; in other words, G” is constructed from G by 
contracting the edge vu, and keeping the new vertex under the name u1. 

Take any connected vertex cover X of G. By the previous argumentation, 
we have that v € X. Moreover, since |V(G)| > 3 and G is connected, X 
cannot consist of only v and hence either u; or uz belongs to G. It now 
follows that X \ {v} is a connected vertex cover of G” of size |X| — 1. 

Let now X’ be any connected vertex cover of G”. Then clearly X’ U {v} 
is a vertex cover of G of size |X’| + 1. Moreover, as uyuz € E(G’), we have 
uy E€ X’ or uz € X’, and hence G[X] is connected. 

Concluding, we have proved that connected vertex covers of G and of G” 
are in one-to-one correspondence relying on exclusion/inclusion of vertex v. 
This claim allows us to formulate the following reduction rule. 


Reduction CVC.5. If there exists a cutvertex v of degree 2 in G, then 
contract one of its incident edges and decrease k by 1. 


We are left with vertices of degree 2 that are not cutvertices. The following 
observation is now crucial. 


Lemma 9.26. Letv be a vertex of degree 2 in G that is not a cutvertex. Then 
there exists a connected vertex cover of G of minimum possible size that does 
not contain v, but contains both neighbors of v. 
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Proof. Let X be a connected vertex cover of G of minimum possible size. If 
v € X, we have Ng(v) C X and we are done, so assume otherwise. Let u1 
and uz be the two neighbors of v in G. As v is not a cutvertex, there exists a 
cycle C in G that passes through u1, v and ug. In particular, u; and uz are 
of degree at least 2. 

As C contains at least three vertices, |X| > 2 and, by the connectivity of 
X, one of the neighbors of v needs to be contained in X. Without loss of 
generality assume that u € X. 

If uz € X, then consider X’ = (X \ {v}) U {u2}. Clearly, X’ is a vertex 
cover of G as well. Moreover, as uz ¢ X, X needs to contain both neighbors 
of uz on the cycle C, and, consequently, G[X"] is connected. 

Assume then that uz € X. Clearly, X \ {v} is a vertex cover of G, but may 
not be connected: G[X \ {v}] may consist of two connected components, X1 
and Xə, where u; € X; and ug € X2. We now focus on the cycle C. As X is 
a vertex cover of G, there exist three consecutive vertices x1, Y, £2 on C such 
that xı € X1, £2 E€ Xə and y ¢ X. However, in this case (X \ {v}) U{y} isa 
connected vertex cover of G of size |X|, and we are done. 


Lemma{9.26]allows us again to treat a vertex of degree 2 that is not a cutver- 
tex as an “annotation” to its neighbors: the only role it plays is to annotate 
that these two neighbors can be safely included into the solution. Recall that 
vertices of degree 1 annotate their neighbors in a similar manner, so we should 
be able to replace the type of annotation (see also Fig. (9.5). 


Reduction CVC.6. If there exists a vertex v of degree 2 that is not a cut-ver- 
tex, then delete v and add a neighbor of degree 1 to each former neighbor of 
v. 


Again, the formal verification of safeness of Reduction [CVC.6]is left to the 
reader as Exercise 

We emphasize that Lemma [9.26] proves safeness of Reduction [CVC.6] only 
in the case when it is applied to only one vertex at a time. That is, it is 
possible that there does not exist a single connected vertex cover of minimum 
possible size in G that contains at once neighbors of all vertices of degree 2. 
In particular, consider a case when one of the neighbors of v, say u1, is of 
degree 2 as well. Then an application of Reduction [CVC6]to v turns uz into 
a cutvertex (separating the newly created neighbor of degree 1 from the rest 
of G) and Reduction [CVC.5] triggers on u. 

Observe that an exhaustive application of Reductions 
and completely removes vertices of degree 0 and 2, while Reduc- 


tion limits the number of vertices of degree 1. We may now conclude 
with the following. 


Lemma 9.27. If (G,k) is a yes-instance and no reduction rule is applicable 


to G, then |V(G)| < 4k. 


Proof. Let X be a connected vertex cover of G of size at most k. Consider 
the vertices of Y = V (G) \ X and recall that Y is an independent set in G. 
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Fig. 9.5: A situation where Reduction |CVC.6]is applicable and the outcome 
of its application 


As discussed before, Corollary [9.25] implies that at most 2|X| vertices of Y 
have degree greater than 2 in G. Moreover, no vertex of G may have degree 0 
or 2, while each vertex in X may have at most one neighbor in Y of degree 1. 
Consequently, |Y| < 2|X| + |X| < 3k and |V(G)| < 4k. 














Lemma allows us to claim the final reduction rule that explicitly bounds 
the size of the kernel. We remark that it always triggers when k becomes 
negative in the process of reducing the graph. 


Reduction CVC.’. If G has more than 4k vertices, then conclude that we 
are dealing with a no-instance. 


Finally, we should also remark that all reduction rules are trivially appli- 
cable in polynomial time and all somehow simplify the graph: each reduction 
rule either terminates the algorithm, or decreases the number of edges of the 
graph, or maintains the number of edges of the graph while decreasing the 
number of vertices of degree different than 1. 


9.4 Turing kernelization 


As we will see in Chapter [15] for some problems it is impossible to construct 
polynomial kernels (under widely believed complexity assumptions). How- 
ever, imagine a setting where instead of creating a single small instance, our 
algorithm would be allowed to create a set of small instances together with 
a recipe on how to compute the answer for the original instance, given the 
answers for all the generated small instances. One can go even further, and 
define an adaptive version of such a routine, called Turing kernelization. 
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Definition 9.28 (Turing kernelization). Let Q be a parameterized prob- 
lem and let f : N — N be a computable function. A Turing kernelization for Q 
of size f is an algorithm that decides whether a given instance (x, k) € X* xN 
is contained in Q in time polynomial in |x|+ k, when given access to an oracle 
that decides membership in Q for any instance (x’, k’) with |zx'|, k’ < f(k) in 
a single step. 


As for the classic definition of a kernel, a Turing kernel is polynomial if 
f is a polynomial function. Obviously, regular kernels are Turing kernels as 
well, as it is enough to preprocess an instance in polynomial time and then 
use a single oracle call. Note, however, that Turing kernelization is much less 
restrictive than regular kernelization, as the total size of instances passed to 
the oracle may be even significantly larger than the input size. Furthermore, 
a Turing kernel can be adaptive, that is, its next steps and further queries 
can depend on the answer obtained from the oracle in the past. 

The motivation of Turing kernelization stems from the practical viewpoint 
on the concept of data preprocessing. From a practitioner’s viewpoint, the 
notion of kernelization is supposed to provide a formal framework for design- 
ing preprocessing procedures that lead to instances that are “crackable” by a 
brute-force algorithm. At the end of the day, the practitioner’s goal is usually 
to solve the problem, and it is not that important whether the brute-force 
procedure will be run on a single instance obtained at the end of the prepro- 
cessing phase, or rather that the whole process of solving the task will require 
tackling several small instances along the way, reflecting different subcases. 
Thus, Turing kernelization expresses the practical concept of preprocessing in 
a better way than the classic definition of kernelization, introduced in Chap- 
ter [2| Unfortunately, this comes at a cost of having much weaker theoretical 
tools for analyzing this relaxed notion of a kernel. 

In this section, we present an example of a problem that separates the 
classes of parameterized problems admitting polynomial kernels and admit- 
ting polynomial Turing kernels. In the MAX LEAF SUBTREE problem, we are 
given a graph G together with an integer k and the question is whether there 
is a subtree T of G with at least k leaves. Recall that Exercise (6.15]asked you 
to show that MAX LEAF SUBTREE admits a nonuniform FPT algorithm. 

Observe that if G has several connected components, then T has to reside 
in a single connected component of G, as it is connected. The MAX LEAF 
SUBTREE problem does not admit a polynomial kernel under widely believed 
complexity assumptions (see Exercise[15.4| point [2). However, intuitively, its 
kernelization hardness is caused by the fact that given a graph with several 
connected components, it is hard to identify those components which are 
unlikely to contain a solution. As a matter of fact, the situation changes 
drastically if we assume the given graph is connected. 
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9.4.1 A polynomial Turing kernel for MAX LEAF 
SUBTREE 


Our first step is to show that, with an additional assumption that the given 
graph is connected, MAX LEAF SUBTREE admits a polynomial kernel in the 
classic sense. Observe that if a solution T for an instance (G,k) exists in a 
connected graph G, then there also exists a solution which is a spanning tree 
of G. This is because extending a subtree of G to a spanning tree cannot 
reduce the number of its leaves. 

One can show that subdividing an edge of a graph that connects two 
vertices of degree 2 does not change the maximum number of leaves over all 
spanning trees of a graph. The following reduction rule is the reversal of such 
an operation, and the proof of its safeness is left to the reader as Exercise[9.8| 


Reduction MLS.1. If there is a vertex v of degree 2 where the neighbors 
u1, U2 Of v are nonadjacent and also have degrees equal to 2, then delete v 
and add an edge u1u2. 


In the main lemma of this section, we want to show that if G does not con- 
tain long paths with all inner vertices of degree 2 in G (i.e., when Rule[MLS.1] 
is not applicable) and G has appropriately many vertices, then (G,k) is def- 
initely a yes-instance. 


Lemma 9.29. If Reduction |MLS.1| is inapplicable and |V(G)| > 6k?, then 


(G,k) is a yes-instance. 


Proof. For the sake of contradiction, assume that |V(G)| > 6k? and (G,k) 
is a no-instance. Let us first select greedily a set S of vertices at distance 
at least 3 from each other (i.e., have disjoint closed neighborhoods), in the 
following way. Recall that for a positive integer d and vertex set S C V(G), 
we denote by N“@[S] the set of vertices of G within distance at most d from 
S. Consider the following procedure. Initially, set S = @ and, as long as 
V(G) \ N?[S] contains a vertex of degree at least 3 (in G), add to S a vertex 
of V (G) \ N?[S] that has the largest degree (in G). Let r = |S] and v1,..., Ur 
be the sequence of vertices added to S$ in this order. Denote d; = dega(v;). 


Claim 9.30. If >), <;<,(di — 2) > k, then (G,k) is a yes-instance. 


Proof. Note that, by construction, the vertices in the set S have pairwise 
disjoint neighborhoods. Consequently, we can construct a forest F consisting 
of r stars centered at vertices of S having exactly ieee d; leaves. As G 
is connected, we can transform F into a tree in a step-by-step manner, each 
time adding to F a path between two of its connected components, losing at 
most two leaves at a time. We need to add at most r—1 paths to connect the 
r stars, hence we lose at most 2(r—1) < 2r leaves. Therefore, )7, <;<,.(di — 2) 
is a lower bound on the number of leaves of the resulting tree. 4 
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In particular, if r > k then we are done, because all the vertices in S are of 
degree at least 3 and by Claim (G, k) is a yes-instance. In the following, 
by distg(u,v) we denote the distance between u and v in G. 


Claim 9.31. If, for some v € V(G) and d > 1, we have |{u € V(G) 
distg(u, v) = d}| > k, then (G, k) is a yes-instance. 


Proof. It is enough to construct any tree spanning all the vertices at distance 
at most d—1 from v and attach vertices at distance d as leaves. J 


Since G is connected, we can conclude that each connected component 
of G \ N?[S] contains at least one vertex at distance exactly 3 from some 
vertex of S. In the next claim, we give an upper bound on the number of 
such vertices. 


Claim 9.32. For any positive integer d, if there are more than r(k — 1) ver- 
tices u € V(G) that are at distance exactly d from S (i.e., diste(u, S) = d), 
then (G,k) is a yes-instance. 


Proof. As |S| = r, by the pigeonhole principle there exists v € S such that at 
least k vertices of G are at distance d from v. By Claim|9.31|for this particular 
vertex v, (G, k) is a yes-instance. 


By Claim |9.32/for d = 3, there are at most r(k — 1) < k? connected compo- 
nents of G \ N?[S]. Hence, to bound the number of vertices of V(G) \ N?[S], 
it is enough to bound the number of vertices of a single connected component 
of G \ N?[S]. 


Claim 9.33. Each connected component of G\ N?(S] consists of at most four 
vertices. 


Proof. Denote H = G \ N?[S]. As all the vertices of H are of degree at most 
2, the graph H consists of paths and cycles only. If H contained a connected 
component of size at least 5, then there would be a vertex v of degree 2 in 
H, such that both its neighbors would be of degree 2 in H as well. However, 
no vertex of V(H) = V(G) \ N?[S] is of degree more than 2 in G, hence 
Rule [MLS.1| would be applicable, a contradiction. 4 


Consequently, V(G) \ N?[S] contains at most 4k? vertices. By Claim [p.32] 
for d= 1 and d = 2, there are at most 2r(k — 1) vertices that are at distance 
1 or at distance 2 from S. This implies |N?[S]| < rt+r(k—1)+r(k—1) < 2k?. 
Therefore, G contains less than 6k? vertices in total, a contradiction. This 
finishes the proof of Lemma 














By Lemma we infer the following reduction rule. 


Reduction MLS.2. If |V(G)| > 6k?, then conclude that (G,k) is a yes- 
instance. 
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Consequently we have the following theorem and a Turing kernelization 
as a corollary. 


Theorem 9.34. The MAX LEAF SUBTREE problem in connected graphs ad- 
mits a kernel with less than 6k? vertices. 


Corollary 9.35. The MAX LEAF SUBTREE problem admits a polynomial 
Turing kernelization. 


Proof. Consider an instance (G, k) of MAX LEAF SUBTREE and let C4, ..., Cp 
be the connected components of G. We invoke the kernelization algorithm 
from Theorem on r instances (C4, k), where the graph is obviously con- 
nected, producing an equivalent instance (C{,k’), where both V(C%) and k’ 
are bounded by O(k?). Finally the Turing kernelization routine calls an oracle 
that determines the answer for each instance (C/, k’). If any of these answers 
is positive, then we can return yes, and otherwise it is safe to return no. 














Let us repeat that Corollary stands in a counterposition to Exer- 
cise [15.42] While Max LEAF SuBTREE admits a polynomial Turing kernel, 
the existence of a classic polynomial kernel can be refuted under standard 
complexity assumptions. 


Exercises 


9.1 (Æ). Consider the following weighted version of the task of finding the maximum 
number of vertex-disjoint T-paths in a graph G: each edge of G is additionally equipped 
with a nonnegative weight, and we look for a family of pairwise vertex-disjoint T-paths 
that has the maximum possible cardinality and, among those of the maximum possible 
cardinality, has the minimum possible total weight. Prove that this problem is polynomial- 
time solvable. 


9.2 (8). Consider the following weighted variant of FEEDBACK VERTEX SET: each vertex 
of the graph is additionally equipped with a nonnegative weight, and we look for a feedback 
vertex set that has at the same time at most k vertices and total weight at most W, where 
both k and W are given as an input. Adjust the argumentation of Section [9.1] to give a 
kernel for this variant with the number of vertices bounded polynomially in k. 


9.3. In the MAXIMUM SATISFIABILITY problem, we are given a formula » with n variables 
and m clauses, together with an integer k, and we are to decide whether there exists an 
assignment satisfying at least k clauses. Show that this problem is FPT when parameterized 
by k’ = k — m/2. 


9.4 (&). Prove formally that Reduction|CVC.4lis safe. In particular, show that the min- 
imum possible size of a connected vertex cover before and after its application are equal. 


9.5 (2). Using Lemma prove formally that Reduction |CVC.4lis safe. 


9.6 (£). In the Eocr DOMINATING SET problem, we look for a set of at most k edges in 
the given graph G whose endpoints form a vertex cover of G. Show that EDGE DOMINATING 
SET in planar graphs admits a kernel with O(k) vertices. 
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9.7 (£8). Consider the following problem: given a planar graph G and integers k and d, 
decide if there exists a set X of at most k vertices of G such that each connected component 
of G\X has at most d vertices. Prove that this problem admits a kernel with O(kd) vertices. 


9.8. Prove that Reduction|MLS. lis safe. In particular, show that there is a subtree with 
at least k leaves in the graph before the reduction if and only if there is a subtree with at 
least k leaves in the graph after the reduction. 


9.9. Prove that one can strengthen Claim[9.33] and show that each connected component 
of G \ N?[S] consists of at most three vertices, leading to a slightly improved 5k? upper 
bound on the number of vertices in the kernel. 


9.10 (&). Improve the reasoning of Lemma 19.29] to obtain an O(k) bound on the number 
of vertices of G. 


Hints 


Recall that finding a matching that has minimum possible weight among matchings 
of maximum possible cardinality is polynomial-time solvable. 


First, observe that the argumentation of Section[9.1|breaks in the weighted setting at 
Lemma 9.8 as the vertices of | D may have much smaller weights than those of Z. You 
need to strengthen this lemma to force any feedback vertex set of size at most k to contain 
either v or Z. The main trick to obtain it is to replace the use of q-expansion lemma for 
q = 2 with a use for q = k + 2. The cost of this change is a cubic instead of quadratic 
bound on the number of vertices of the final graph. 


Show that there exists a constant c > 1/2, such for that each CNF formula y which 
does not contain at the same time a single literal clause x and its negation ~z, there exists 
an assignment satisfying at least cm clauses. 


Proceed as with CONNECTED VERTEX CovFR in Section Use Corollary [9.25 
with the set A being the endpoints of the hypothetical solution to bound the number of 
vertices of degree more than 2. Vertices of degree 0 and 1 can be treated very similarly to 
the approach in Section [9.3] A bit more work is required with vertices of degree 2. First, 
show how to reduce many such vertices with common neighborhoods. Second, show that 
planarity implies that there are only O(k) possible pairs of vertices of a solution that share 
a common neighbor of degree 2. 


Again, proceed as in Section[9.3]and apply Corollary[9.25]to the hypothetical solution 
X. Although the connected components of G \ X are not isolated vertices, they have size 
bounded by d and you need to care only about the components that neighbor one or two 
vertices in X. 


Note that a connected component of H which is a path of four vertices needs to have 
at least one of its endpoints of degree two in G. 


[9.10] Improve Claim [9.32] to get an O(r) bound on the number of vertices at distance d 
from G. To achieve this, combine the reasonings of Claims[9.30] and [9.31] 
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Chapter 10 


Algebraic techniques: sieves, 
convolutions, and polynomials 


In this chapter we discuss selected techniques that have 
some “algebraic flavor,” i.e., they use manipulation of 
algebraic expression, rather than analyzing the combi- 
natorial properties of objects. Another common theme 
in these techniques is “sieving”: we sieve out some un- 
wanted objects by means of algebraic cancellation. 





Essentially all the techniques we cover in this chapter are older than param- 
eterized complexity and have been developed in different contexts. We begin 
in Section with the classical inclusion—exclusion principle and present 
its simple (but non-obvious!) applications to problems like HAMILTONIAN 
CYCLE or CHROMATIC NUMBER, where the only parameter involved is the 
number of vertices, and a slightly more technical application to STEINER 
TREE parameterized by the number of terminals. 

Next, in Section [10.2] we show that the algorithms based on the inclusion- 
exclusion principle can be reinterpreted using two transforms of functions on 
the subset lattice, i.e., functions of the form f: 2” — Z. These transforms, 
called the zeta transform and the Möbius transform, play also a similar role 
as the Fourier transform and the inverse Fourier transform in the harmonic 
analysis of functions, e.g., they can be used for fast computation of convolu- 
tions of functions on the subset lattice. This is shown in Section with 
an application to CHROMATIC NUMBER. In Chapter [M] we will see more ap- 
plications of this technique in accelerating dynamic-programming algorithms 
for bounded treewidth graphs. 

Finally, in Section [10.4] we present algorithms based on evaluating multi- 
variate polynomials. To use this approach for a parameterized problem, one 
designs a polynomial p that 


e can be evaluated in FPT time, and 
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e encodes all the solutions, in particular p is nonzero if and only if there is 
at least one solution. 


Then it turns out that there is a simple randomized algorithm for testing 
whether p is nonzero based on the Schwartz-Zippel lemma. We illustrate this 
scheme by algorithms for LONGEST PATH. In the resulting algorithms, one 
can observe an algebraic reformulation of the color-coding approach from 
Section 

Throughout this chapter, unless otherwise specified, n and m denote the 
number of vertices and edges of the input graph. We will often use two differ- 
ent bracket notations in this chapter. The first is the Iverson bracket notation, 
which is 1 if the condition in square brackets is satisfied, and 0 otherwise. For 
example, for integers a, b,c, d, the value of the expression [a < b,c < d] is 1 if 
a < b and c < d both hold, and 0 otherwise. The second usage of brackets is 
to denote for an integer k the set [k] = {1,..., k}. 

We will often use the binomial theorem, which states that )>;"_, (")a’b"-? = 
(a +b)”. Note that if a = —b, then this expression is 0 for every n > 0, but 
it is 1 for n = 0. 


10.1 Inclusion—exclusion principle 


In this section, we present applications of the inclusion—exclusion principle, a 
well-known tool in combinatorics. Usually, the inclusion—exclusion principle 
is described as a formula for computing |(j_, Ai| for a collection of sets 
Aj,...,An. For two sets A; and Ag, the size |A; U Ag| of the union can be 
less than the sum |A| + |Ag| of the sizes: the expression |A;| + |A2| counts 
the size of the intersection A; N A2 twice, hence we have to subtract it to 
avoid double counting: 


|A U Aa| = |Aı| + |42| — | A1 N Aol. 
For three sets A1, Az, A3, we have the formula 
|A UA2UA3] = |A |+| A2|+|43|-|41N42|-|A41N43|-| 42N A3|+|4A1NA2NA;3]. 


That is, after subtracting the size of the pairwise intersections, the common 
intersection A;7A2NA3 was considered three times positively and three times 
negatively, thus we have to add it once more. Below we state the formula for 
the general case. 


Theorem 10.1 (Inclusion—exclusion principle, union version). Let 
Aj,...,An be finite sets. Then 


|U 4l= S> Cyt. A (10.1) 


ie[n OAX C[n] iEX 
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Proof. Consider any element e € Vien A;. Let k > 1 be the number of 
sets A; containing e and let A;,,...,A;, be these sets. Note that for X 4 Q 
we have e € ({),- A; if and only if X C {i1,...,i,}. Hence the amount e 
contributes to the right-hand side of equals 


k 
Do cpt =cny ey =e- 


OAXCLin,...,in} 

















as required. 


Intuitively, Theorem [10.1|allows us to compute the size of the union if for 
some reason the sizes of all the intersections are easy to compute. However, 
for our applications, we need the dual form of this statement that allows an 
easy computation of the size of the intersection if the sizes of all the unions are 
easy to compute. We state the dual form below. It can be easily derived from 


Theorem using De Morgan’s laws so we skip the proof here; however, in 
Section we prove a generalized, weighted inclusion—exclusion principle. 


Theorem 10.2 (Inclusion—exclusion principle, intersection version). 


Let Aı,..., An CU, where U is a finite set. Denote (\,<g(U \ Ai) =U. Then 


LM) Al = X D] NUA]. 


i€[n] XC[n] icx 


In a typical algorithmic application of the inclusion-exclusion formula 
we need to count some objects that belong to a universe U, and it is in 
some sense hard. More precisely, we are interested in objects that satisfy 
n requirements A;,..., An and each requirement is defined as the set of 
objects that satisfy it. Thus, the inclusion—exclusion formula translates 
the problem of computing [Niel] A;| into computing 2” terms of the 
form |(),<x(U \ A;)|. In our applications, computing these terms is in 
some sense easy. If, for example, each of them is computable in poly- 
nomial time, then the inclusion—exclusion formula gives an algorithm 
which performs 2"n°) arithmetic operations. 


10.1.1 HAMILTONIAN CYCLE 


The idea above applies smoothly to the well-known HAMILTONIAN CYCLE 
problem: given a directed graph G, we want to decide whether G contains 
a cycle which contains all the vertices of G, called the Hamiltonian cycle. 
In fact, we will solve a more general problem — we will count the number 
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of Hamiltonian cycles. Recall that Exercise |6.2| asked you for a 2°n°-time 
algorithm using dynamic programming. In this section we show that using 
the inclusion—exclusion formula we can get an algorithm running in roughly 
the same time, but using only polynomial space. 

A walk vo,..., Uz is called closed when vo = vz. We also say that the length 
of this walk is k, i.e., the number of its edges. 

Let vo be an arbitrary vertex from V(G) and let the universe U be the 
set of all closed walks of length n of the form vp,v1,...,Un = Vo. For every 
vertex v € V(G), define a requirement A, C U as the set of all closed walks 
of length n that start at vo and visit v at least once. Clearly, our goal is 
to compute |(),<y(q) Av|: a closed walk of length n can visit all vertices at 
least once only if it is a cycle of length exactly n. By the inclusion—exclusion 
principle (Theorem [10.2) this reduces to computing, for every X C V(G), the 
value of |(),cx(U \ Av)|, i.e., the number of closed walks of length n from vo 
in graph G” := G — X. This is easily computable in polynomial time: if M is 
the adjacency matrix of G”, then we compute the n-th power M” and return 
the entry from the diagonal of M” that corresponds to vo (see Exercise[10.2}. 
Therefore, we can compute in polynomial time each of the 2” terms of the 
inclusion-exclusion formula and hence we can compute | (),,<y(g) Aul- 

There is one more thing to check. So far we just showed that the number 
of Hamiltonian cycles can be found using 2”n°) arithmetic operations. The 
question is how costly is an arithmetic operation? Every number that appears 
in the algorithm is bounded by the number of walks of length at most n from 
some vertex, which is O(n”). This means it takes O(nlogn) bits and each 
arithmetic operation takes polynomial time. An important property of the 
algorithm sketched above is that it takes only O(n7), i.e., polynomial, space. 


Theorem 10.3. The number of Hamiltonian cycles in a directed n-vertex 
graph can be computed in time 2°nC and polynomial space. 


10.1.2 STEINER TREE 


In the unweighted STEINER TREE problem, we are given an undirected graph 
G, a set of vertices K C V(G), called terminals, and a number £ € N. The goal 
is to determine whether there is a tree H C G (called the Steiner tree) with 
at most £ edges that connects all the terminals. Recall that in Section 
we presented a 3/KlnO(-time algorithm for STEINER TREE using dynamic 
programming (in fact, that algorithm works also for the weighted version, 
where we have arbitrary weights on the edges). Here we show a 2IklnO(- 
time algorithm for (the unweighted version of) this problem parameterized 
by |K|. Without loss of generality, we assume |K| > 2. 

The idea is to apply the inclusion—exclusion-based algorithm similar to 
the one we use for HAMILTONIAN CYCLE. There, we observed that although 
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Fig. 10.1: An ordered rooted tree H and a branching walk (H,h) in a graph 


counting paths or cycles is hard, counting walks is easy. Here, instead of count- 
ing trees, we will count branching walks. Recall that in an ordered rooted tree 
the children of every node have a fixed order. This assumption is important 
because the algorithm we present here counts certain branching walks and 
their number depends on whether the corresponding tree is ordered or not. 
A branching walk is a relaxed notion of a tree, or more precisely a ho- 
momorphic image of a tree. Formally, a branching walk in graph G is a pair 
B = (H,h) where H is an ordered rooted tree and h: V(H) > V(G) is 
a homomorphism, i.e., if zy € E(H) then h(x)h(y) € E(G). We say that 
branching walk B is from s when h(r) = s, where r is the root of H. We de- 
note by V(B) = h(V(#)) the set of vertices in G where h maps the vertices of 
H. Observe that h(V(H)) induces a connected graph in G. The length of the 
branching walk B is defined as |E(H)|. Of course, every walk is a branching 
walk (with tree H being a path). Also, every tree H in G corresponds to a 
branching walk (H,h) with h being the identity function. One can see that 
the image h(H) = {h(u)h(v) : uv € E(H)} is a tree if and only if h is injec- 
tive. Figure [10.1]shows an example of a branching walk with non-injective h. 
The following observation provides a connection between the STEINER TREE 
problem and branching walks (the easy proof is left for the reader). 


Observation 10.4. Let s € K be any terminal. Graph G contains a tree H 
such that K C V(#H) and |E(H)| < £ if and only if G contains a branching 
walk B = (Hpg, h) from s in G such that K C V(B) and |E(Hp)| = £. 


Following Observation we will count branching walks of length £ 
from s that visit all the terminals and return true if and only if the resulting 
number is nonzero. Our universe U is now the set of all branching walks of 
length Z from s. For every v € K, we have the requirement A, = {BEU : 
v € V(B)}. Hence, our goal is to compute |N, eg Avl- By the inclusion- 
exclusion principle, this reduces to computing, for every X C K, the value 
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of |(,ex(U \ Av)|. Note here that sex (U \ Av) is the set of exactly those 
branching walks from U that avoid the vertices of X. 


Lemma 10.5. For every X C K, the value of |(,cx(U \ Av)| can be com- 
puted using O(mé?) arithmetic operations. 


Proof. Let G' = G — X. For a € V(G’) and j € {0,..., £}, let b;(a) be the 
number of length j branching walks from a in G”. Notice that if s ¢ V(G’) 
we should return 0, and otherwise our goal is to compute be(s). We compute 
b;(a) for all a € V(G") and j € {0,...,2} using dynamic programming with 
the following recursive formula: 


1 if 7 =0, 
bj(a) = 5 5 bj, (a)bja (t) otherwise. 


tENg (a) jı+j2=j—1 


In the formula above, for every neighbor t of a, we count the number of 
branching walks where t is the first child of a; for a fixed t we consider all 
possibilities for the size j2 of the subtree rooted at t (and jı is the number of 
remaining edges of the tree excluding the edge at as well). Note that this is the 
point where considering branching walks instead of trees pays off: multiplying 
bj, (a) and b,, (t) is justified, as we do not require the branching walks starting 
at a and t to be vertex disjoint; hence we may combine any two of them. We 
leave the formal proof of the correctness of the formula to the reader. The 
time bound follows, since for every j € {0,...,2} and every edge of G’, the 
inner sum is computed twice. 














Since the numbers in our computations have O(£ log n) bits (Exercise|10.4), 
arithmetic operations take polynomial time. We have just proved the follow- 
ing theorem. 


Theorem 10.6. The unweighted STEINER TREE problem can be solved in 
time 2IK|.n°™ and polynomial space. 


In the previous section we showed an algorithm for counting Hamiltonian 
cycles, which obviously also solves the corresponding decision problem. Here 
we solved unweighted STEINER TREE by counting branching walks. It is 
natural to ask whether one can count Steiner trees within similar running 


time. The answer is no, unless FPT = W[1] (see Exercise |10.5). 


10.1.3 CHROMATIC NUMBER 


A k-coloring of a graph G is a function c: V(G) > [k], such that c(u) 4 c(v) 
whenever u and v are adjacent. In the CHROMATIC NUMBER problem we are 
given a graph G with an integer k and we want to know whether G has a 
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k-coloring. Recall that Exercise [6.1] asked you for a 3"n°)-time algorithm, 
using dynamic programming. In this section we provide a faster algorithm 
based on the inclusion—exclusion principle. Clearly, each color class in k- 
coloring is an independent set and every subset of an independent set is also 
an independent set. This leads us to the following easy observation. 


Observation 10.7. G has a k-coloring if and only if there is a cover of V(G) 
by k independent sets, i.e., there are k independent sets I4, ..., J, such that 


By Observation it suffices to compute the number of covers of V(G) 
by independent sets. Our universe U is now the set of tuples (1,..., Ix), 
where J; are independent sets (not necessarily disjoint nor even different!). 
Clearly, we need to cover every vertex, i.e., for every v € V(G) we have the 


requirement 
TO EU: ve: i}. 


Then |(),,<v(qy Av| is the required number of covers of V (G) by independent 
sets. For Y C V(G), let s(Y) be the number of independent sets in G[Y], the 
subgraph of G induced by Y. By the inclusion—exclusion principle, finding 
|Nvev(c) Av| reduces to computing, for every X C V(G), the value of 


{ass JEU: h, CVG )\ X}] = sv GOX. 


Although we do not know how to compute s(V (G) \_X)* in polynomial time, 
s(Y) can be computed at the beginning simultaneously for all subsets Y C 
V (G) using a simple dynamic programming routine that employs the formula 


s(Y) = s(Y \{y}) + s(Y \ Ny), (10.2) 


where y is an arbitrary vertex in Y. This requires ce arithmetical oper- 
ations on O(n)-bit numbers, and takes space 2”n°\). Once we have s(Y) 
computed for every Y C v(Q), the value of s(V(G) x) can be found in 
polynomial time by O(log k) multiplications of O(nk)-bit numbers. We have 
just proved the following theorem. 


Theorem 10.8. The CHROMATIC NUMBER problem can be solved in time 
and space 2°n0), 


It is natural here to ask for a polynomial space algorithm. A 2”n°™ time 
polynomial-space algorithm is currently not known. Let us modify the algo- 
rithm from Theorem [10.8] at the cost of worse running time. When we aim 
at polynomial space we cannot afford to tabulate the number of independent 
sets s(Y) for every Y C V(G). If we just use the formula [10.2] in a recur- 
sive procedure, without storing the computed values, computing s(Y) takes 
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2I¥|nO® time for each subset Y C V(G) separately. By the binomial theorem 
the total time needed to compute | (),,cy(q) Av| using the inclusion—exclusion 
principle is then 


y glV(G)\X1 | ,OG) = (>: 4 ia) no) = 3770), 


XCV(G) k=0 


vEV 


However, there are better branching algorithms for finding the number of 
independent sets in a given n-vertex graph. In Exercise the reader is 
asked to show such an algorithm running in time 1.381"n°“). The currently 
best. known polynomial-space algorithm is due to Wahlström and runs 
in 1.238"n° time. By a similar application of the binomial theorem as 
above we get the following result. 


Theorem 10.9. The CHROMATIC NUMBER problem can be solved in time 
2.238"n° and polynomial space. 


Finally, let us note that there is a 2”n°(-time algorithm for counting 
k-colorings and we present one in Section {10.3.1 


10.2 Fast zeta and Möbius transforms 


In this section, we consider functions on the subset lattice, i.e., functions 
of the form f: 2” — Z, where V is some n-element set (in fact, Z can be 
replaced by any ring). It turns out that the inclusion—exclusion algorithms 
from Section [10-T]can be phrased in terms of transforms between functions on 
the subset lattice, i.e., functions that take as an argument a function f: 2” > 
Z and return another function g: 2” + Z. We use simplified notation, i.e., 
for a transform a and function f the function a(f) is denoted as af. The 
first is the zeta transform, defined as 


(Cf)(X) = SO FY). 


YCX 


Let us also define the Möbius transform, 


(uf (X) = SO CDY). 


YCX 
It will be convenient to have the following odd-negation transform, 
(of )(X) = CDX). 


We will frequently use compositions of transforms. The composition ao 8 
of two transforms a and £ is denoted shortly by ap. Recall that in the 
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composition notation a is applied after £, i.e., for every function f: 2Y > Z 
we have aff = a(6(f)). Clearly, oo is the identity transform id. It is also 
easy to see (Exercise |10.10) that the three transforms above are related as 
follows. 


Proposition 10.10. ¢ = opo and u = oo. 


As we will see, the following formula is a counterpart of the inclusion— 
exclusion principle. 


Theorem 10.11 (Inversion formula). We have uC = Cu = id, that is, 
F(X) = (ue f)(X) = (Cuf) 
for every X CV. 


Proof. We observe that: 


(uC f)(X) = (acoc f)(X) (Proposition [10.10) 

= (-1)/* 5 (o¢f)(Y) (expanding c and ¢) 
YCX 

= (1*1 5 eD S F(Z) (expanding o and () 
YCX ZCY 

=(-1)"! 5 f(z). So eD” (reordering terms) 
ZOX ZCYCX 

= f(X)+(-1)*! >, F(Z). 5 (-1)!¥! (separating Z = X) 

ZOX ZCYCX 
sums to 0 


= f(X). 


The inner sum in the penultimate line is equal to 0 because every nonempty 
finite set A has the same number of subsets of odd and even cardinality (here 
we consider A = X \ Z). To see this, pick any a € A. We partition all subsets 
of A into pairs: every subset S C A\{a} can be paired up with the set SU{a}. 
Each such pair has one set of even size and one set of odd size, which proves 
the claim. 

The second part of the theorem follows by Çu = GoGo = opooga = 
opuco = oo = id. 














Let us revisit the CHROMATIC NUMBER problem, now using the trans- 
forms. For X C V(G), let f(X) be the number of tuples (1,...,[,), where 
I, are independent sets in G and eS, I; = X. Then, for every X C V(G), the 
value f(X) is the number of covers of X by k independent sets. Observe that 
(CIX) = ML ycx f(Y) is the number of tuples (1,...,1,), where I; are in- 


dependent sets in G and Hi I; C X. Using the notation from Section|10.1.3 
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polynomial time after a 2"n°()-time preprocessing, as in Section By 
the inversion formula (Theorem|10.11), we have f(V(G)) = (u¢f)(V(G)) and 
hence we can compute the values of f from ¢f directly using the definition 
of u in time 2”"n°). By Observation graph G is k-colorable if and only 
if f(V(G)) 4 0. This re-proves Theorem 

As we will see, sometimes it is useful to compute the zeta or Möbius 
transform for all subsets X of V. The algorithm that computes these val- 
ues separately for each subset using the definition takes O(S0ycy 2!*!) = 
Om) (*1)2*) = O((2 + 1)") = O(3") arithmetic operations and evalua- 
tions of the transformed function f. It turns out that this can be done in 
time roughly the same as the size of the output 2”. 


we have (¢f)(X) = s(X)*, and hence the values of ¢f can be computed in 
0.1.3] 










Theorem 10.12 (Fast zeta/Mébius transform). Given all the 2” values 
of f in the input, all the 2” values of Cf and uf can be computed using 
O(2” -n) arithmetic operations. 


Proof. Consider the zeta transform. Let V = {1,...,n}. Instead of consider- 
ing f to be a function on subsets of V, it will be convenient to interpret f as 
an n-variable function on {0,1}. We let f(a1,...,a%n) be f(X), where X CV 
is the set with characteristic vector (£1,..., £n), that is, i € X if and only if 
x; = 1. With this interpretation, the zeta transform is defined as 


(CF) Tista) = 5 [y1 < Bigs sy S En]: FUG, oss) 


(recall that the value of the bracket is 1 if the expression in the bracket is 
true and 0 otherwise). 
Consider fixing the last n — j bits, that is, we define 


Gressan) = > [yi < Tiree Yy < ralf (Y -Yj Ejtan) 
< 
Y1,- Yj € {0,1} fixed 


Consistently, denote ¢o(#1,...,2n) := f (£1, .--, £n). Observe that ¢,(X) = 
(¢f)(X). The values of Qj (£1,..., £n) for all j € {1,...,n} and z1,..., £n € 
{0, 1} can be found by dynamic programming using the following formula. 


Gja liyer tn) when z; = 0, 
G (tsrs En) = Cfo (Bien Vi tjhe in) F 
Ring ay Opa Be) when x; = 1. 





Indeed, if x; = 0, then every nonzero term has yj = 0 (and hence y; = z3), 
thus we may consider x, fixed as well. On the other hand, if x; = 1, then 
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we need to consider two types of terms: those with y; = 1 and those with 
yj; = 0. Therefore, by fixing the value of the j-th coordinate to 1 and 0, 
respectively, we get the contribution of the two types of terms. We see that 
the computation takes O(2” - n) arithmetic operations. The claim for the 
Möbius transform follows now by Proposition [10.10] 














In Theorem [10.12] we state the time bound only by means of the number 
of arithmetic operations. The precise asymptotic bound for the time depends 
on the model of computations used, e.g., answers to questions like “how long 
does it take to address an n-dimensional array?” or “what is the cost of an 
arithmetic operation?” (note that the algorithm computes sums of 2(2”) 
input numbers). However, if an arithmetic operation on two input numbers 
(values of function f) can be carried out in time polynomial in n, then we 
can safely claim a 2"n°) upper bound on the running time. 

Using the fast Möbius transform, in the last step of the coloring algo- 
rithm described in this section, instead of computing just (u¢f)(V(G)) we 
can compute f(X) = (u¢f)(X) for all X C V(G). By Observation [10.7] for 
every X C V(G) the graph G[X] is k-colorable if and only if f(X) 4 0. It 
follows that in time 2°n° we have found all k-colorable induced subgraphs 
of G. 


10.3 Fast subset convolution and cover product 


In this section, we consider binary operations on functions on the subset 
lattice. The subset convolution of two functions f,g: 2” > Z is a function 
(f *g): 2” > Z such that for every Y C V, 


FDY) = X F(X) g9(¥ \ X). 


XCY 


Note that equivalently, 


(f*9(¥)= D> F(A)g(B). 
AUB=Y 
ANB=0 
The cover product of two functions f,g: 2” — Z is a function (f *e 
g): 2V + Z such that for every Y C V, 


FDY) = X FAB). 


AUB=Y 


Subset convolution and cover product are computed in many algorithms. 
Perhaps the most natural application in parameterized complexity is accel- 
erating the dynamic programming over tree decompositions, and we will see 
such examples in Chapter [11] At the end of this section, we motivate subset 
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convolution by presenting an algorithm that counts proper colorings for all 
induced subgraphs of a graph. 

Now let us focus on algorithms for computing (all the 2!”! values of) the 
subset convolution and the cover product. The naive algorithms following 
from the definition require O(3") and O(4”) arithmetic operations, respec- 
tively. Indeed, for the subset convolution the number of additions and multi- 
plications is proportional to the total number of pairs of sets X and Y such 
that Y C X C V, which equals Xpo (7)2* = 3” by the binomial theo- 
rem. For the cover product, we consider all the triples (A, B, Y) such that 
A,B C Y C V. There are 2” . 2” = 4” such triples, since every choice of 
subsets A, B C V determines the triple. 

In what follows, we show that the subset convolution and cover product 
can be computed much faster, i.e., in time 2"n°). (Note that this is the size 
of the input, up to a polynomial factor.) 

The pointwise product of two functions f,g: 2” > Z is a function (f - 
g): 2V + Z such that for every Y C V, 


(F-D) = FY): 9Y). 


Clearly, the pointwise product can be computed in O(2”) arithmetic opera- 
tions. This motivates the following lemma. 


Lemma 10.13. The zeta transform of the cover product is the pointwise 
product of the zeta-transformed arguments, i.e., C(f *eg) = (Cf) - (Cg). 


Proof. For every Y C V we have, 


(fse) = J J fAg(B= $ FAB) 


XCY AUB=X AUBCY 


=| X F(A] | $ 9B) | = CP) Co). 


ACY BCY 














A reader familiar with the classic Fourier transform considered in cal- 
culus will certainly recognize in Lemma [10.13]an analogue of one of the 
most fundamental properties of the Fourier transform: the zeta trans- 
form translates convolution into point-wise multiplication. By the in- 
version formula we get a backward translation by means of the Mobius 
transform. Formally, we have 


f*eg = UCF) - (C9). 
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Using Lemma |10.13] we now give an algorithm for computing the cover 
product. 


Theorem 10.14. For two functions f,g: 2” — Z, given all the 2” values 
of f and g in the input, all the 2” values of the cover product f *.g can be 
computed in O(2” - n) arithmetic operations. 


Proof. Similarly as in the example with coloring using ¢ and p-transforms, 
by Lemma the ¢-transform of the desired result is easy to compute. 
More precisely, all the 2” values of ¢(f xe g) can be found by computing all 
the 2” values of Çf and all the 2” values of Cg, and finally multiplying the 
results for every Y C V. By Theorem [10.12] this takes O(2” - n) arithmetic 
operations. By the Inversion Formula (Theorem 10.11}, we can compute f*eg 
by applying the Möbius transform to ¢(f xe g). This is done again in time 


O(2" . n) using Theorem |10.12 


We now show how to use the algorithm for computing the cover product to 
compute the convolution. The key idea is to consider separately, for every 
0 <i < |X|, the contribution of those terms of f x g where |A| = i and 
|B] = |X|- i. 














Theorem 10.15 (Fast subset convolution). For two functions f,g: 2” > 
Z, given all the 2” values of f and g in the input, all the 2” values of the 
subset convolution f xg can be computed in O(2" . n?) arithmetic operations. 


Proof. Let us introduce the following notation which truncates a function to 
sets of a fixed cardinality: 


fx (S) = £(S) - [|S] = k]. 


The key to the fast subset convolution computation is the observation that 
disjointness can be controlled using cardinalities, as follows. 


(fxg(X)= X f(A)g(B) 

AUB=X 

ANB=0 

|X| 
=- E stags) 
i=0 AUB=X 

|Al=i]B[=1X|-i (10.3) 

|X| 
=> 5 fi(A) gx \-i(B) 
i=0 AUB=X 

|X| 


= Shi že 9\x|-i)(X). 
i=0 


Note that the second equality holds, since if A and B form a partition of X, 
then clearly |A| = i and |B| = |X| — i for some 0 < i < |X]. Our algorithm is 
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as follows. First, for all i, j € {0,...,n} and X C 2” we compute and store 
(fi *e 9;)(X). By Theorem it takes O(2” - n3) operations. Then we can 
compute (f x g)(X) for all X C 2” using the identity in O(2” - n) 
operations. 














10.3.1 Counting colorings via fast subset convolution 


In this section, we motivate the notion of subset convolution by an algorithm 
that counts the number of k-colorings in every induced subgraph of the input 
graph G. Note that we interpret a k-coloring as a function 2” (©) — [k] and we 
count the number of distinct functions that are valid k-colorings. This means 
that we consider two colorings to be distinct (and count both of them) if they 
partition the vertices into color classes the same way, but the colors of the 
classes are different. 

Define a function s: 2Y(C) — {0,1} such that for every X C V(G) we 
have s(X) = 1 if and only if X is an independent set. Then 


(s* 5 *++-*8)(X) 


k times 


is the number of k-colorings of G[X] (note that a k-coloring needs not use all 
the k colors). It can be found in time and space Pnl) by applying the fast 
subset convolution algorithm (Theorem |10.15) k — 1 times (or even O(log k) 
times, if we apply a fast exponentiation algorithm, that is, we iteratively 
compute the 2?-th power by multiplying the 2’-th power with itself). We 
have proved the following theorem. 


Theorem 10.16. The number of k-colorings in every induced subgraph of 
the input graph G can be computed in time and space 2°n°™, 


10.3.2 Convolutions and cover products in min-sum 
semirings 


Recall that a semiring is an algebraic structure similar to a ring, but with- 
out the requirement that each element must have an additive inverse. More 
precisely, a semiring is a triple (R,+,-), where R is a set and + and - are 
binary operations called addition and multiplication. Both operations are as- 
sociative, addition is commutative, multiplication distributes over addition, 
i.e., a: (b+c) = (a -b)+ (a c) and (b + c)va = (b- a) + (c- a). Finally, there 
are two elements 0,1 € R such that 0 is the additive identity and annihilates 
R, i.e., a- 0 = 0, while 1 is the multiplicative identity. In a ring, additionally, 
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for every element a there is an element 6 (called an inverse of a) such that 
a+b=0. 

In applications we frequently need to compute convolution-like expressions 
of the form 


(f*9)(Y) = min (f(A) + 9(B)), 
ANB=0 


or 


(f*eg)(¥) = „min (f(A) + 9(B)). 


At first glance reusing the subset convolution and cover product notation 
here seems confusing. However, one can notice that in fact these are the 
subset convolution and the cover product for functions f and g with values 
in the integer min-sum semiring, i.e., the semiring (ZU {+00}, min, +). (Note 
that +oo is the additive identity in this semiring.) Similar expressions with 
max replacing min might also be of interest and then they correspond to the 
subset convolution and the cover product in integer maz-sum semiring, i.e., 
the semiring (Z U {—oo}, max, +). 

Note that in the previous sections we were working in the (Z, +, -) ring}| 
and in fact the fast cover product algorithm from Theorem and the 
fast subset convolution algorithm from Theorem [10.15] are correct for any 
ring. These algorithms rely on the fast zeta and Möbius transforms from 
Theorem[10.12] We encourage the reader to verify that the fast zeta transform 
algorithm works perfectly fine in a semiring; however in the definition of the 
Möbius transform the additive inverse is used, hence the Möbius transform 
cannot even be defined in a semiring. Hence we cannot apply the fast cover 
product and the fast subset convolution algorithms in the (ZU{+oo}, min, +) 
semiring, since integers do not have inverses under the min operation. It is 
unclear whether we can extend these algorithms to semirings. However, if f 
and g have small integer values we can use a simple embedding trick to get 
the following result. 


Theorem 10.17. For two functions f,g: 2Y + {—M,...,M}, given all the 
2” values of f and g in the input, all the 2” values of the subset convolution of 
f and g over the integer min-sum (or maz-sum) semiring can be computed in 
time 2°n°) . O(M log M loglog M). The same holds for the cover product. 


Proof. Let B = 2” +1. We define two functions f’, g': 2” — {0,...,8?} by 
putting f/(X) = BM@t+fOO and g/(X) = BM+9™), Then, 





1 We would like to emphasize that in this section the role of the + operation changes: in 
the min-sum semiring, + plays the role of the multiplicative operation. 
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(fxg (VY) = Y PAY AX) 
XCY 
_ D B2M+FOO+IY\X) — S opti, 
XCY i=—2M 


where a; is the number of sets X such that f(X) +g(Y \ X) =i. Note that 
a; € {0,...,2!”1} and 2!¥! < 2” < 8, hence the values of a; can be extracted 
from the value of (f’ * g’)(Y), as we can interpret the a;’s as digits in base-6 
notation. Then, (f x g)(Y) = min{i : a; > O}. 

It follows that f *g over the min-sum semiring can be obtained from f’ xg’ 
computed over the standard integer ring. By Theorem [10.15] this takes O(2”- 
n?) arithmetic operations on O(Mn)-bit integers, and each of them takes 
time O(Mnlog(Mn) log log(Mn)) if the Schénhage-Strassen multiplication 
algorithm is used. The proof for the cover product is analogous, and it 
is left as Exercise [10.12] 














Let us conclude this section with a simple application (see Section 
for another one). Consider a variant of the CHROMATIC NUMBER problem, 
where we are given an undirected graph G, an integer k and a cost function 
c: V(G) x [k] > {-—M,...,M}. The cost of a coloring x: V(G) — [k] is 
defined as ”`,ey(c) C(v, x(v)). In other words, coloring a vertex v € V(G) 
with a color i € [k] has cost c(v,i). We want to determine the minimum 
cost of a_k-coloring of G (provided that such colorings exist). Similarly as in 
Section for every color i € [k] let us define a function s;: 2”(9) > 
ZU {+00} such that for every X C V(G) we have 


(X) Sex cl(z,i) when X is an independent set, 
Si E 
' +00 otherwise. 


Then, 
(81 * 8g *+++ * 8%)(X) 


evaluated in the min-sum semiring is the minimum cost of a k-coloring 
of G[X]. All these costs for all subsets X C V(G) can be found in time 
2"nP M log M log log M by applying Theorem k — 1 times (or even 
O(logk) times, as noted in Section [10.3.1). We have proved the following 
theorem. 


Theorem 10.18. Given a graph G, an integer k and a cost function c: V(G)x 
[k] > {—M,...,M} one can compute in time 2°n°™ M log M log log M the 
minimum, cost of a k-coloring of every induced subgraph of G. 
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10.4 Multivariate polynomials 


In this section, we are going to revisit the LONGEST PATH problem introduced 
in Section Using randomization in a different way and with the help of 
algebra we will get a significantly faster FPT algorithm. It is convenient 
here to abuse the standard notation and by k-path we mean a path on k 
vertices, while k-walk is a sequence of vertices v,,..., vz such that neighboring 
vertices in the sequence are adjacent. Let us recall that in the LONGEST PATH 
problem, given a graph G and an integer k, we ask whether there is a k-path in 
G. We begin with recalling two tools that play a crucial role in this approach. 

Our first tool is the use of finite fields. Let us recall that a field is a triple 
(F,+,-), where F is a set and + and - are binary operations called addition 
and multiplication. Both operations are associative and commutative and 
multiplication distributes over addition. There are two elements 0,1 € F 
such that 0 is the additive identity and 1 is the multiplicative identity. Finally, 
every element of F has an additive inverse, i.e., for every a € F there is an 
element b € F such that a +b = 0; and every element of F \ {0} has a 
multiplicative inverse, i.e., for every a € F \ {0} there is an element b € F 
such that a-b = 1. The most common examples are the fields of rational, real 
and complex numbers. However, here we focus on finite fields, in particular, 
finite fields of size 2° for some integer s. 

It is known that for every prime p and integer s > 1, there is a unique 
finite field (up to isomorphism) of size p°. Finite fields of size p for a prime 
p are easy to construct: we can simply take the set {0,1,...,p — 1}, with 
addition and multiplication modulo p. Constructing fields of size pê for s > 2 
is more involved and describing these fields in detail is beyond the scope of 
this book. However, here we need to know just two basic facts about fields of 
size 2°: 


e We can represent elements of a field of size 2° as bit vectors of length 
s and perform field operations (addition, subtraction, multiplication, 
division) on these bit vectors fast, i.e., in time O(s log s log log s). 

e These fields are of characteristic 2, i.e., 1 + 1 = 0. In particular, for 
any element a, we havea+a=a-(1+1)=a-0=0. 


The simplest field of characteristic 2 is the familiar GF(2), which corre- 
sponds to the set {0, 1} with addition and multiplication modulo 2. In general, 
if there is a field of size q € N, then there is exactly one such field (up to 
isomorphism) and it is denoted by GF(q). 

We will be working with multivariate polynomials such as 


2 
C1£1 £3 + coz3r3 rags + C3, 
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interpreted over a finite field F; here c1, c2, cz are constants from the field F. 
Formally, recall that a (multivariate) monomial is an expression of the form 
m = axi zy --- xo", where a is an element of a ring F (in our case it will be a 
finite field), 21,...,2,, are variables and c1,...,Cn are nonnegative integers. 
Note that the monomial m is a formal object represented by the sequence 
(a,C1,---,€n). The degree of m is defined as X`; cn. Monomials with the 
same power sequences (c1,...,Cn) can be added using the distributive law, 
C1 „C2 


H C. C1 „C2 C A C1 C2 C 
i.e., az ay? o a + batt as? o ag = (at bjati as? o aor. 


A polynomial is a finite sum of monomials, i.e., an expression of the form 


p= D Qoen ti Be on “De (10.4) 
(€1,-+.,Cn) €(NU{O})” 


where the coefficients a.,,...-,, are nonzero only for a finite number of tu- 


n 


ples (c1,...,Cn)- We can then define addition and multiplication on these 
formal objects in the usual way. In such a way we get a ring, denoted by 
F[|z1,..., £n]. This ring has the (additive) zero element, namely the element 


where all the coefficients ac, ,.c, are equal to 0. The corresponding polyno- 
mial is called the zero polynomial, or the identically zero polynomial. The 
degree of the polynomial p is defined as the maximum degree of its mono- 
mials in the expansion (10.4), i.e., max{} 1 Ci? Gey,...,c, # 0}; the zero 
polynomial has no degree. 

As described above, a polynomial, which is a formal object, can be repre- 
sented as a set of tuples of the form (a,c1,...,¢n). We can also represent a 
polynomial as an arithmetic circuit, with addition and multiplication gates, 
where the input gates correspond to variables and coefficients. The circuit 
corresponds to an expression which, by applying the associativity of addition 
and distributivity of multiplication over addition, can be reduced to an equiv- 
alent expanded expression which is a sum of monomials, like (10.4). However, 
note that the size of the expansion can be exponential in the size of the origi- 
nal expression /circuit, e.g., the expansion of the expression [J"_, (x; +1) has 
2” monomials. This is one of the crucial properties we exploit here: in the 
polynomials we are going to use, the set of monomials corresponds to a huge 
search space (e.g., the set of all k-vertex paths), but the whole polynomial 
can be represented by a much smaller circuit. 

A polynomial p on variables 71,...,2, can be evaluated in a given tuple 
(Z1,...,%n) of elements from the field F. By p(%#1,...,%n) we denote the 
value of an expression representing p where for every i € {1,...,n} the 
value £; substitutes the variable x;. Note that the arithmetic circuit that 
represents the polynomial corresponds to an algorithm that computes the 
evaluation (and the running time of the algorithm is linear in the number 
of gates of the circuit). A polynomial p defines a function fp: F” —> F in 
a natural way, i.e., fp(@1,...,@n) = p(%1,...,¥n). It is however important 
to note that if a function f corresponding to a polynomial in F[|z1,..., £n] 
evaluates to 0 for every argument (i.e., is the zero function), it does not imply 
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that the polynomial is the identically zero polynomial (while the converse 
obviously holds). Indeed, consider for example F = GF(2) (with addition and 
multiplication modulo 2) and the polynomial x + 2? (see also Exercise[10.14). 
Nevertheless, we note that for every function f: F” — F there is at most 
one polynomial p of degree smaller than the size of the field F such that for 
every argument (%1,...,%n) € F” we have f(%1,...,%n) = p(£1,.--,¥n) (we 
ask the reader to prove a slightly more general version of this statement in 
Exercise (10.14). 

Our second tool is the following claim, called the Schwartz-Zippel lemma. 
It is well known that a nonzero univariate polynomial of degree d over the reals 
has at most d roots, thus the roots are indeed very sparse. For multivariate 
polynomials, we may have arbitrarily many roots; consider, for example, the 
polynomial 7122. The following lemma shows (in a discrete setting) that roots 
of multivariate polynomials are sparse: when we randomly sample the values 
of the variables from a finite subset of the field, the probability of finding a 
root is small. 


Lemma 10.19. Let p(£1,£2,..., £n) E F|£1,..., £n] be a polynomial of de- 
gree at most d over a field F, and assume p is not identically zero. Let S be a 
finite subset of F. Sample values aj, a2,...,an from S uniformly at random. 
Then, 

Pr(p(a1,a2,..., an) = 0) < d/|S|. 


Proof. We use induction on n. For n = 1, the claim follows from the well- 
known fact that a nonzero univariate polynomial of degree at most d over a 
field F has at most d roots. This can be proved by the polynomial remainder 
theorem, which states that a polynomial p in a single variable x has a root r 
if and only if polynomial « — r divides p in the ring F'[z]. This implies that 
a polynomial of degree at most d can have at most d different roots. 

Now consider n > 1. Let us rewrite the polynomial p as 


d 
Dia. ty) = a - Pil£2,..., En). 
i=0 


Let k be the largest index i such that the polynomial p; is not identically 
zero; such a value k exists, since p is not identically zero. Note that pp is of 
degree at most d — k. For sampled values of az,...,@,, define a univariate 
polynomial in zı: 


k 
g(a) = a - pi(G2,---, Qn). 
i=0 


Let E; be the event “q(a1) = 0” and let E2 be the event “pp(az,..., an) = 0”. 
By Ez we denote the complement of €2. Then, 


Pr(p(ai, A2,++-, an) = 0) = Pr(£1) 
= Pr(£ N E2) + Pr(£ N E2). 
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By the induction hypothesis, Pr(€; N E2) < Pr(E2) < or. Hence it suffices 
to show that Pr(€, N £2) < if: When Pr(€) = 0 the claim clearly holds. 
Otherwise, Pr(E, N £2) = Pr(E, | E2) Pr(E2). Hence it suffices to show that 
Pr(& | E2) < ai Indeed, the event €2 depends only on the values of ag, ..., 
an and for any fixed ag, ..., an such that Ez is not true, q(x1) is a univariate 
polynomial that is not identically zero (as x has a nonzero coefficient). This 
means that g(x) is of degree exactly k. Therefore, fixing a2, ..., an such 
that €2 is not true, there are at most k values of a; for which q(ai) = 0, 
hence we can bound the conditional probability Pr(& | E2) by Er 














A typical algorithmic application of Lemma has the following 
scheme. We can efficiently evaluate a polynomial p of degree d, and we 
want to test whether p is a nonzero polynomial. Then, we pick S so 
that |S| > 2d and we evaluate p on a random vector s € S”. We answer 
YES if we got p(s) Æ 0, and otherwise we answer NO. If p is the zero 
polynomial, then we always answer NO, and otherwise we answer YES 
with probability at least Z. Hence we get a Monte Carlo algorithm with 
one-sided error. 


For example, assume we are given two functions f,g: F — F, for a field 
F. The functions are given as (arbitrarily complicated) arithmetic circuits 
with only addition, subtraction and multiplication gates, and we want to 
test efficiently whether f and g are equal. Note that f and g correspond to 
polynomials represented by the circuits. Hence, the function f— g corresponds 
to a polynomial p. We can easily bound the degree of p by a maximum number 
d of multiplication gates on a leaf-to-root path in the circuits representing 
f and g. We evaluate f — g in a random element of a finite subset of the 
field F larger than 2d, and we report that f = g if and only if we get 0. If 
f = g, then we always get 0 and the answer is correct. Otherwise, p is not 
the identically zero polynomial, so we answer NO with probability at least 
Z. Note also that by repeating the algorithm, say logn times, we can lower 
the error probability to O(4); see Chapter 

The problem described above is usually called POLYNOMIAL IDENTITY 
TESTING, and the Schwartz-Zippel lemma provides an efficient randomized 
algorithm solving it. Actually, no deterministic analogue of this algorithm is 
known, and providing such is considered a major open problem. 


10.4.1 LONGEST PATH in time 2*n° 


We proceed to show the following theorem. 
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Theorem 10.20. There is a one-sided error Monte Carlo algorithm with 
false negatives that solves the LONGEST PATH problem for directed graphs 
in time 2*n° and polynomial space. 


The general idea is clear: we want to find a polynomial P such that P 
is nonzero if and only if the input graph G contains a k-path. What should 
P look like? Assume that for every edge e of graph G we have a variable 
Ze, for every vertex v of G we have a variable y,, and P is a function of 
all variables in {ze : e € E(G)}U{yw : v € V(G)}. The first idea 
could be to have a monomial for every k-path in the graph. However, since 
counting k-paths is known to be W[1]-hard, at the first glance evaluating such 
a polynomial fast seems infeasible: we could evaluate this polynomial over a 
large field, say, the field of rationals, substituting 1 for every variable. As we 
will see, it is possible to define such polynomial, so that we can evaluate it 
fast over finite field of characteristic 2. However, to get to this point, let us try 
something different. Counting walks is easy, so assume that for every k-walk 
U1, €1,U2---,;€k—1, Uk in the graph we have the monomial fh: Le mI Yv; 
Then we can expect efficient evaluation; however, instead of just paths, our 
polynomial will also contain monomials for many unwanted walks. In other 
words, instead of checking whether the polynomial is nonzero, we need to 
verify whether it contains a multilinear monomial (i.e., a monomial where 
every variable has degree at most 1) — such monomials are exactly the ones 
that correspond to simple paths. 

To overcome this difficulty, we extend our polynomial by considering 
labelled walks, i.e., we introduce a monomial for every pair (W,?), where 
W =,...,Uz is a k-walk and £: [k] — [k] is a bijection. Thus, every vertex 
vi gets a unique label (more precisely, if W visits a vertex v several times, 
then every visit of v gets a unique label). It turns out that the monomials 
corresponding to the non-path walks cancel out over fields of characteristic 
2. We now proceed to formal argumentation. 

Our final polynomial P has variables of two kinds. For every edge (u,v) € 
E(G) there is a variable x,,,, and for every vertex v € V(G) and a number 
£ € [k] there is a variable yy. The vectors of all z- and y- variables are 
denoted by x and y, respectively. We define 


k-1 k 
P(x,y) = 5 5 II Cu; viy II Yo; l(i): (10.5) 
i=l 


walk W = v1,...,Up £: [k]>[k] i=1 


£ is bijective 


We treat the polynomial P as a multivariate polynomial over the field 
GF(2!!08(4")1). For labelled walk (W, £), we define the monomial 


k-1 k 
mony ,e(x, y) m Il Lui vidi II Yo, l(i): 
i=1 i=l 
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Let us show the cancelling property of P. 
Lemma 10.21. 


P(x,y) = 5 5 monwye(x, y) 


path W = v1,..., vk £: [k]>[k] 
£ is bijective 


Proof. Let W = v1,..., Up be a walk, and let £: [k] — [k] be a bijection. 
Assume Va = Up for some a < b, and if there are many such pairs, then take 
the lexicographically first one. We define 7’: [k] — [k] as follows: 


Lb) ifa=a, 
l(x)= $ La) ifx=b, 
L(x) otherwise. 


By the definition of @’, we have that (a) = V(b) and (b) = (a). Since 
Va = Up, this means that yv, e(a) = Yv, (b) ANd Yr, e(b) = Yva L(a): Note that 
(W,£) 4 (W, 2’) since £ is injective. However, the monomials corresponding 
to (W, £) and (W, 7’) are the same: 


k— k 
monw, = II Tu; vita II Yu; l(i) 
i=1 i=1 


R= 
= II Ly; vigi II Yu; li) ` Yva, L(a) Yup l(b) 
i=1 i€[k]\{a,b} 








pa 
= II Tvi, vipi II Youre! (i) Yop, L (b) Yuval (a) = MON We. 
i=1 i€[k]\{a,b} 


By defining ¢’, we have defined a mapping that maps a pair (W, £) to a pair 
(W, L) such that the monomials mony,¢ and mony, are identical. Applying 
this mapping to (W, l’), we get (W, £) back: we again swap the values of ¢’(a) 
and Z’ (b). This means that we have paired up all the labelled non-path walks 
and for every such pair the two corresponding monomials are identical, so 
they cancel out because the field is of characteristic 2. 














Now, let W be a path and let £: [k] — [k] be a bijection. Consider the 
monomial mony, = ii E SE IŁ, Yv: eti- We see that since W is a 
path, from monw we can uniquely recover (W, £) and hence there are no more 
labelled walks that correspond to mony. Hence the monomial corresponding 
to (W,£) does not cancel out, implying that P is a polynomial that is not 
identically zero. Together with Lemma this observation implies the 


following. 


Corollary 10.22. P is not the identically zero polynomial if and only if the 
input graph G contains a k-path. 
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The only missing element now is how to evaluate P at given vector (x,y) 
efficiently. At this point we need a generalized, weighted version of the 
inclusion—exclusion principle. We state it below with a proof, which is al 
most the same as in the unweighted version (Theorem (10.1). If R is a ring 
and w: U > Risa weight function then the weight of a set X C U is defined 


as w(X) = Zex W(2). 


Theorem 10.23 (Weighted inclusion—exclusion principle). Let A,..., 
An be a family of sets of a finite set U. Denote (),-g(U \ Ai) =U. Then 


w{() 4] = >X ep~. w (qewa): (10.6) 


i€[n] XC[n] iE X 


Proof. Consider any element e € U. Let k be the number of sets A; that 
contain e and let A;,,..., Ai, be these sets. Note that e € (\,-x(U \ Ai) if 
and only if X A {i1,..., ip} = 0. Hence w(e) appears in the right-hand side 
of (10.6) multiplied by 


n—k 
So rs") yeap = [=a] 


XC[n]\ {i2,--,tn } 


This coincides with the contribution of w(e) to the left-hand side of (10.6). 














Let us fix a walk W. Define the universe U as the set of all functions 
£: [k] — [k] and for every £ € U define the weight w(¢) = monw,e(x, y). 
Finally, for i € {1,...,k} let A; = {L€ U : €-1(i) Æ 0}, that is, the set of 
those mappings that contain 7 in their images. Then Niet] A; is the set of 
all surjective mappings. A mapping £: [k] — [k] is surjective if and only if it 
is bijective. Therefore, we have 


k 
5 mony¢(x, y) = 5 mony,e(x,y) = w (A a) . (10.7) 


£: [k]>[k] £: [k]>[k] i=l 
£ is bijective £ is surjective 


Furthermore, by (10.6), 


344 10 Algebraic techniques: sieves, convolutions, and polynomials 


k 
w (Qa) = E p.w (nesa) oy (OH) 


iEX 


be 
N 
Eo 


iEX 


be 
N 
= 


lage 


= w ( N (U \ 4) (characteristic 2) 
(10.8) 


5 monwy,e(X, y) 


k] £: [k]=>[k]\X 


be 
N 


M | 


>. monw,(x,y). (reordering terms) 
k] £: [k] >X 








be 
IN 


In the second equality, the (—1)!*! term disappears because —1 = 1 if we 
use a field of characteristic 2 (this is not crucial; it just simplifies the formulas 
a bit). In the third equality, we observe that ();-.(U \ Ai) contains those 
labellings £ whose images are disjoint from X. Combining Lemma [10.21] with 
(10.7) and (10.8), we get 


5 5 mony,e(x, y) (Lemma 


path W £: [k]>[k] 
£ is bijective 


= 5, bD 5 mony,e(x,y) (by (10.7) and (10.8) 


walk W XC[k] £: [kK] 3X 


= 5 yy > monw,(x,y). (reordering terms) 


XC[k] walk W £: [k] >X 


P(x,y) 


(10.9) 
For a fixed X C [k], let us denote Px (x, y) = X wax w Que: [k] >x MOnw,e(X, y). 
From , it now follows that evaluating P boils down to 2% evaluations of 
N Px. As the following lemma shows, this is a standard dynamic- 
programming exercise. Note that the dynamic programming works in polyno- 
mial time unlike many other dynamic-programming algorithms in this chap- 
ter. 


Lemma 10.24. Let X C [|k]. The polynomial Px can be evaluated using 
O(km) field operations. 


Proof. Using dynamic programming, we fill a two-dimensional table T, de- 
fined as follows 


Tlv, d] = 5 5 I Tvi, Vi41 To 


walk W = 1,...,va £: [dj >X i=1 
vı =V 


Then, 
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5 Yv, when d = 1, 
Tly.d) = } “x 
[v, d] 5 Yot 5 Zo w T[w,d— 1] otherwise. 
LEX (v,w)EE(G) 
In the above recurrence we check all the possibilities for the label of v and 
for the second vertex w on the walk. Hence, Px (x,y) = 5 T[s, k] can be 
sEV (G) 














evaluated using O (km) field operations. 


It follows that the value of P for a given assignment of its variables can 
be evaluated using O(2*km) field operations. The algorithm now proceeds 
as follows. We evaluate P over a vector of random elements from the field 
of size 2lles(4*)] > dk, and we return NO if we get 0; otherwise we return 
YES. Since P is of degree at most 2k —1, the Schwartz-Zippel lemma tells us 
that if there is a k-path, i.e., P is nonzero, then the evaluation returns zero 
(and we report a wrong answer) with probability at most 1/2. This proves 
Theorem 


Let us stop for a while and think about the connections between the 
2*nO algorithm above and the (2e)*n°“) color coding-based algorithm 
from Section In both approaches, we are fighting against collisions: we 
want to consider only those walks where every vertex appears at most once. 
In the color coding approach, we guess a coloring of vertices in k colors, hop- 
ing that vertices of a k-path get k different colors. Testing whether such a 
colorful path exists was done using dynamic programming over all subsets 
of the set of colors. However, it can be also done in polynomial space using 
the inclusion—exclusion principle — this is the topic of Exercise If the 
reader solves this exercise, then it will turn out that for every set of colors 
X, it requires counting the k-walks that have vertices colored with colors 
of [k] \ X only. This in turn can be done by dynamic programming very 
similar to the one that was used to evaluate polynomial Py, where labels 
correspond to colors. It seems that, thanks to the algebraic tools, we were 
able to get rid of the need for sampling the required coloring (or, in a way, 
we iterated over the whole probabilistic space almost for free). Indeed, there 
are more examples of problems that were first solved by color coding, and 
then faster algorithms with the techniques from this chapter were developed 
(see Exercises [10.19] and [10.20). 

Finally, we remark that the use of the inclusion—exclusion formula for 
getting rid of the bijectivity requirement is not the only way to obtain the 
running time of 2*n°“), Perhaps a simpler solution is to use dynamic pro- 
gramming, and compute the entries of the following table, for every subset 
X C [k] and vertex v € V(G). 
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|X|-1 |X| 
T[X, v] = 5 5 II Lu, vig TENG 
walk W = v1,... vx] £: [[X|]—>[k] i=1 i=1 
use e((k) =X 


(The reader can notice yet another connection with color coding here; in- 
deed, this dynamic programming resembles determining the existence of 
a colorful path in Lemma [5-5]) Notice that after computing table T the 
evaluation of polynomial P is obtained easily using the formula P(x,y) = 
ovev(c) TIlk], v]. However, such an algorithm uses exponential space. 


10.4.2 LONGEST PATH in time 2*/?n° for undirected 
bipartite graphs 


In this section we deal with an undirected bipartite graph G with bipartition 
V(G) = Vi U V2. Our goal is to accelerate the algorithm from the previous 
section to time 2*/2n° = 1.42'nO). For simplicity, assume that k is even. 
Recall that the exponential factor in the running time of that algorithm 
depends on the number of labels used only. Can we use just k/2 labels? An 
obvious attempt in this direction is to label only those vertices of the walks 
that are in V,. This simple idea almost works. Our new polynomial would be 
the following: 


k-1 k/2 
Qo(x,y) = 5 5 II i eg II Yur:1,e(i)- (10.10) 
walk W = v1,..., Uk L: [k/2]3[k/2] t=1 t=1 
viEVı £ is bijective 


. k-1 k/2 
Again, let us denote mony, = Į [i Darii is Yozi) 


There are three differences between the above polynomial and the polyno- 
mial of (10.5). First, a minor one, is that we sum over walks that begin in Vj. 
Since G is bipartite and k is even, every path on k vertices has one endpoint 
in Vı and the other in V2, so we do not lose any path. Second, the labelling 
bijection has domain/co-domain of size k/2. The meaning of this is that for 
a pair (W, £), for every i € {1,...,k/2}, the vertex v2;_1 gets label ¢(i). Note 
that all these vertices lie in V,, while the other vertices of the walk are in Vo. 
Third, since now we are working with undirected graphs, the edge variables 
are indexed with unordered pairs of vertices (SO Guy = Zvu). 

Observe that, as before, for a path W and bijection £: [k/2] > [k/2], we 
can uniquely recover (W, £) from mony», hence the monomial corresponding 
to (W,£) does not cancel out. It remains to show that non-path walks do 
cancel out. Consider a non-path labelled walk (W, £), W = v1,..., Up. Since 
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ġo, z 


Vi = Vj Vi = Vj 


Fig. 10.2: The problematic case. The white vertex belongs to V; and the grey 
vertex belongs to Vz 


W is not a path, there exist some pairs (i, j) such that i < j and v; = vj. Let 
us call such pairs bad. There are two cases to verify. 


Case 1. There exists a bad pair (i, j) such that v; = v; € Vi. If there are 
several such bad pairs (i, j), we pick the lexicographically first one. Note that 
both v; and v; have labels, so we can get a different labelling ¢’ by swapping 
their labels, i.e., swapping the values of ¢(44+) and (4#). The monomials 
mon(W, £) and mon(W, ¢’) are exactly in the same so we can proceed as in 


Lemma [10.21 


Case 2. For all the bad pairs (i,j), we have v; = vj € Vo. Again we pick 
the lexicographically first bad pair (i, j). Now vertices v; and v; do not have 
labels, so we cannot use the swapping trick. The new idea is to use the fact 
that G is undirected and reverse the closed walk v;, v;41,...,v;. Let W” be the 
new k-walk. We also reverse the order of labels on the closed walk, i.e., in the 
new labelling ¢’, for every t € {1,..., 5*}, we put L (4 +t) = (f +1-—t) (so 
that every vertex receives the same label as before), and otherwise ¢’ is equal 
to Z. It is easy to see that mon(W, £) = mon(W’, ¢’) as desired. Moreover, 
if we start from (W’,@’) and follow the same way of assignment, then we 
get (W, £) again. The last thing to check is whether we really create pairs in 
this way, i.e., whether (W’,¢’) 4 (W, £). This is true when the closed walk 
Ui, Vi+1;---, Uj is of length greater than 2, because then it visits Vi at least 
twice and V Æ £ (because £ is injective). However, in the only remaining case, 
when the closed walk is of length 2, 2 = ¢’. Moreover, if we reverse a closed 
walk of length 2 we get exactly the same walk, so W = W” (see Fig. (10.2). 
So the case with a closed walk of length 2 is problematic. 

We see that the polynomial Qo defined in does not. work, but it 
is quite close to what we need. Can we fix it? The most naive thing to do 
is to modify the polynomial to exclude the problematic case. Surprisingly, 
this works. We say that a walk v1,..., Uk is admissibl¢?| when for every i € 
{1,...,k — 2}, if v; € Vo and vj41 € Vı then vj42 4 vi. Define 





2 The condition vj41 € Vi seems redundant here, but we will use the same definition for 


general graphs in Section |*10.4.3 
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k-1 k/2 
Q(x, y) = 5 5 JI teva [voie 0010 
walk W = v1,..., Uk L: [k/2]>[k/2] i=1 i=1 
viEVı £ is bijective 
W is admissible mon(W,é) 


Note that for a path W and bijection £: [k/2] > [k/2], the monomial 
mon(W, £) does not cancel out as before. Also, the proof for Case 1 is valid 
again (we do not change the walk so it stays admissible). In Case 2, we 
also get mon(W, £) = mon(W’, ¢’) and now, since W is admissible, we avoid 
the problematic case and (W’, é’) 4 (W, £). However, we need also to verify 
whether W” is admissible. Let (i,j) be defined as in Case 2. Assume W’ = 
W1,---, Wz is not admissible, i.e., for some t € {2,...,k —1} we have w1 € 
Vo, we € Vi and w1 = w1. Ift < i—1 or t > 7 + 1 then the same 
sequence appears in W, so W is not admissible, a contradiction. Similarly, if 
te {i+1,...,j—1}, then the same sequence appears in W in reversed order, 
contradiction again. Hence t € {i,j}. However, since v, € Vı then Case 1 
would apply, a final contradiction. 


Corollary 10.25. Q is not the identically zero polynomial if and only if the 
input graph G contains a k-path. 


Now it remains to show that Q can be evaluated in time 2*/2n°(). Exactly 
in the same way as in Section |10.4.1| from the inclusion—exclusion principle 
we derive the following 


Q(x, y) = 5 5 5 monw,(x,y). 


XC[k/2] admissible walk W £: [k/2] >X 


For a fixed X C [k/2], we define polynomial 


Qx(x,y)= 5 5 mony,e(x, y). 


admissible walk W £: [k/2] >X 


For every X C [k/2], the polynomial Qx can be evaluated in polynomial 
time, by a standard extension of the dynamic programming described in the 
proof of Lemma [10.24] We leave the details for the reader as Exercise 
(the crux is to keep track of not only the first vertex of the walk, but also of 
the second one). This finishes the proof of the following theorem. 


Theorem 10.26. There is a one-sided error Monte Carlo algorithm with 
false negatives which solves the LONGEST PATH problem for undirected bi- 
partite graphs in time 2*/2n°Q = 1.42'nO™ and polynomial space. 
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f(5) f(4) 


a) F2) f(6) 


Fig. 10.3: Enumerating labelled objects in a walk by function f. The white 
vertices belong to V; and the grey vertices belong to V2 


*10.4.3 LONGEST PATH in time 23*/4n° for undirected 
graphs 


In this section, we extend the algorithm for bipartite graphs to arbitrary (but 
still undirected) graphs, at the cost of worse running time of 234/4n07) = 
1.69*n°). Recall that to improve over the 2*n°\)-time algorithm it suffices 
to reduce the number of labels. We will mimic the bipartite case. We begin 
with choosing a random bipartition V = Vi U V2, i.e., every vertex is assigned 
to Vi or V2 with equal probability 1/2. We use labelled walks again, but this 
time we label not only vertices from V; but also V2V2-edges, i.e., every edge 
that has both endpoints in V2. How many labels should we use? To label a 
walk v1,..., Ug we need: 


e a different label for each i € {1,...,k} s.t. v; € Vi, and 
e a different label for each i € {1,...,k} s.t. vi, vier E€ V2. 


For every i € {1,...,k/2}, either one of {va;-1,v2;} is in Vi or both of 
them are in V2, so we need at least k/2 labels. However, if the walk W lies 
entirely in V, or V2, then we need k or k — 1 labels, and then our approach 
will bring us only to 2*n°“) running time. In order to use less labels, we 
will not consider all the walks. For an integer K, a walk W = v1,..., Uk is 
L-admissible when it is admissible (in the same sense as in Section 
and |{i : v; ©Vit| + [{i : vivi+ı € Vo}| = L. Note that in an L-admissible 
walk, we can label all its visits of Vi-vertices and V2V2-edges using L different 
labels. 

The polynomial we are about to define contains standard variables 2, w 
and ye, that we used in the previous sections, however now in the variables 
Ye,e the index e can be either a vertex or a V2V2-edge. There are also new 
variables z„ for each u € V(G). Likewise, the vector of variables z,, shall be 
denoted by z. The reason we add these variables shall be explained later. 
Now, for every L, let us define a polynomial 


k-1 L 
R(X, y, z) = 5 5 Zvi II Lyivisr Il Y fw (i) Lli) 
t= i=1 


walk W =11,..., vk £: [L]>[L] 
W is L-admissible 2 is bijective 
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where fy(i) is the i-th labelled object (Vj-vertex or V2V2-edge) in walk W, 
in the order of visiting vertices /edges by the walk (see Fig. (10.3). Again, the 
term zv, em, Torii MŁ, Y fw (i),€(¢) Corresponding to one labelled walk will 
be denoted by mon(W, £). 

The final polynomial for the non-bipartite case is defined as follows: 


ud 


R= 5 Ry. 


L=k/2 


Note that we use [2k] labels only. However, can we really ignore all the walks 
that are not (< [3k])-admissible? Note that if there is a k-path in G, but all 
the k-paths are not (< [#k])-admissible, the polynomial R can be the zero 
polynomial, so the analogue of Corollary does not hold! Nevertheless, 
recall that whether a path is L-admissible or not depends on the chosen 
random bipartition of V(G), and if there is a k-path we are allowed to make 
a mistake with a small probability. Fix a k-path P = v,,...,v,. For the 
random choice of bipartition (V1, V2), consider a random variable 


L(P) = |{i : vi € Vi}| + |{i : viviga € V2}. 
By the linearity of expectation we infer 


k k-11 3k-1 


E(P) =5 +4 ; 





It follows that sometimes P should be (< [#k])-admissible. We can bound 
it more precisely using Markov’s inequality, which states that Pr(X >a) < 
E(X)/a holds for every nonnegative random variable X and positive a. 


(3k —1)/4 _ 
[Šk] +1 





Pr(P is not L-admissible for all L < [3k]) < Q(k-*). 


Moreover, it is easy to check that for any bijection £: [L(P)] > [L(P)], from 
mon(P, £) we can uniquely recover P and £. Note that here we need the 
new Z% factor, because without it the monomial mon(P, £) corresponds to 
two labelled walks: one which traverses P from vı to vg and another which 
goes in the other direction. Hence the analogue of the ‘=’ implication of 


Corollary |10.22]is as follows. 


Corollary 10.27. Ifthe input graph G contains a k-path, then the polynomial 
R is not identically zero with probability Q(k—'). 


Although the success probability may seem to be small, observe that, as 
our algorithm has one-sided error, it is enough to repeat it klogn times 
and the failure probability drops to (1 — Q(k7!))* Lsn = (e~ 2k") )Flosn — 
e7 Logn) Z 1/n?®. 
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Now we proceed to show that if the polynomial R is nonzero, then G 
contains a k-path. In other words, we show that non-path walks cancel out. 
Consider a non-path labelled L-admissible walk (W, £), where W = v1,..., Uk 
and L < [3k]. The analysis of Case 1 from the bipartite case is still valid. 
In Case 2 (i.e., whenever v; = vj then i = j or v; € V2), we choose the 
lexicographically first pair (i, j) such that v; = vj (hence v; € V2). We pair 
up the labeled walk (W, £) with a new labelled walk (W’, £’), which is obtained 
from (W, £) by reversing the closed walk v;,v;41,...,v;, and also reversing 
the order of labels on this closed walk. The same argumentation as in the 
bipartite case shows that W’ is L-admissible, and that (W’, 7) is assigned 
back to (W, £). 

By the definition of (W’, ¢’), we have that {(fw (i), @(4)) : i=1,..., L} = 
{(fw (i), l (i)) : i = 1,..., L}. It follows that mon(W, £) = mon(W’, 2’). 
Let us show that (W', 7) # (W,4). Assume W = W’. This means that 
Vi, Uil; -< -, Uj = Vi, Vj-1,---,V;i- In other words, if we put t = (i + j)/2, then 
our closed walk follows a walk from v; to v; and back: 


Ui, Vi+1, aa Uj = Ui, Vi4+1; essy Ut_l1, Ut, Ut-15-+- > Vi+1, Vi. 


Note that vj41,..., 0-1 € Və for otherwise Case 1 applies. Moreover, v € Va, 
for otherwise the closed walk v,_ 1 v;0441 makes W not admissible, a contradic- 
tion. Hence, the closed walk being reversed contains at least two V2V2-edges. 
We reverse the order of labels assigned to these edges, so ¢’ 4 ¢. Thus we 
have shown that (W, V’) 4 (W, £). Moreover, if we follow the same way of 
assignment we get (W, £) again, so just as in the previous sections we parti- 
tioned all the non-path labelled walks into pairs with the same monomials. 
This means that the monomials cancel out as required. 

Now it remains to show that for every i = k/2,..., [3k/4], the polynomial 
R; can be evaluated in time 2'n°™). As in the bipartite case, using inclusion- 
exclusion this reduces to a standard exercise in dynamic programming. To 
enumerate the walks W with L(W) = i we additionally keep track of the 
number of Vj-vertices and V2V2 edges. The details of this simple adjustment 
are left to the reader. This proves the following. 


Theorem 10.28. There is a one-sided error Monte Carlo algorithm with 
false negatives which solves the LONGEST PATH problem for undirected graphs 
in time 23*/4n°) = 1.69'nPC and polynomial space. 


Let us finally make a note on the derandomization of the algorithms for 
LONGEST PATH discussed in this chapter. In Section [5.6] we explain how ran- 
domized algorithms based on color coding can be derandomized by a small 
additive cost O(log? k) in the exponent. It is not clear at all if a similar result 
can be claimed for algorithms from this chapter based on monomial testing. 
We will explain in Chapter [L2] (Theorem [12.37) how techniques based on rep- 
resentative sets of matroids can be used to obtain a deterministic 2.619*n°- 
time algorithm for LONGEST PATH. 
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Exercises 


10.1 (&). Derive Theorem from Theorem using De Morgan’s laws. 


10.2 (Z). Let A be an adjacency matrix of a directed graph G over the vertex set [n], 
i.e., aij = [(i, j) € E(G)] for i,j = 1,...,n. For a positive integer k, let M = AF. Show 
that for every i, j =1,...,n, the number of length k walks from i to j is equal to mij. 


10.3 (Z). Prove Observation 


10.4 (&). Show that the number of length £ branching walks in an n-vertex graph can 
be represented using O(£ log n) bits. 


10.5. Knowing that counting k-vertex paths in FPT time implies that FPT = W/[1] by 
a result of Flum and Grohe [188], show that it is impossible to count solutions for the 
STEINER TREE problem, unless FPT = W[1]. 


10.6. The algorithms for HAMILTONIAN CYCLE and STEINER TREE tree presented in this 
chapter suffer a multiplicative O(nlog° n) overhead in the running time because of 
performing arithmetic operations on O(n log n)-bit numbers. Show that one can shave this 
overhead down to O(log n) at the cost of getting a one-sided error Monte Carlo algorithm 
with false negatives occurring with probability O(n™ 1). 


10.7. In the TSP problem, we are given a complete graph with a weight function w : 
V? — {0,...,W} and the goal is to find a Hamiltonian cycle H of the smallest weight 
(ie, Xavera W(u,v)). Describe a 2°n OQ) . W-time, Wn©)-space algorithm for the 
TSP problem. 


10.8 (Z). In the List COLORING problem, we are given a graph G and for each vertex 
v € V(G) there is a set (also called a list) of admissible colors L(v) C N. The goal is to 
verify whether it is possible to find a proper vertex coloring c: V(G) > N of G such that 
for every vertex v we have c(v) € L(v). In other words, L(v) is the set of colors allowed for 
Vv. 

Show a 2°n°()-time algorithm for List COLORING. 


10.9. Show an algorithm which computes the number of perfect matchings in a given n- 
vertex bipartite graph in 2"/2n OQ) time and polynomial space. (The solution is called 
Ryser’s Formula.) 


10.10 (&). Prove Proposition |10.10 
10.11 (Æ). Show that (ZU {+00}, min, +) is a semiring. 


10.12 (Æ). Modify the proof of Theorem]10.17]to get an algorithm for cover product in 
min-sum semirings. 


10.13. The packing product of two functions f,g: 2V — Z is a function (f *pg):2” > Z 
such that for every Y C V, 


(Fer g(Y)= D> SAIB). 
A,BCY 
ANB=0 


Show that all the 2!”! values of f *p g can be computed in 2”n°() time, where n = |V]. 
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10.14. Let p be a polynomial in n variables over the finite field GF(q), such that the degree 
of p in each variable (i.e., the maximum exponent of a variable) is strictly less than q. Show 
that if p evaluates to zero for every argument (i.e., p corresponds to the zero function), 
then all the coefficients of p are zero (p is the identically zero polynomial). 


10.15. Let f be a function f : GF(q)" —> GF(q) that corresponds to a polynomial (possibly 
of large degree). Show that there is exactly one polynomial p that evaluates to f(1,...,@n) 
for every tuple (x1,..., £n) E GF(q)” such that the degree of p in each variable (i.e., the 
maximum exponent of a variable) is strictly less than q. 


10.16. In the 2*n©°“)-time algorithm for Loncest PATH, we introduced labels so that 
the non-path walks pair up and hence cancel out. This resulted in O(2*) overhead in the 
running time. In the 24/2n°()-time algorithm for bipartite graphs we used a different 
argument for pairing-up non-path walks, by reversing closed subwalks. Does it mean that 
we can skip the labels then? It seems we cannot, because then we solve LONGEST PATH in 
polynomial time! Explain what goes wrong. 


10.17. Given a graph G and a coloring c : V(G) = [k], verify in O(2" poly(|V|)) time and 
polynomial space whether G contains a k-path with vertices colored with all k colors. 


10.18. In the WEIGHTED LONGEST PATH problem, we are given a directed graph G with a 
weight function w : E(G) > {0,...,W} and the goal is to find a k-path P of the smallest 
weight (i.e, X uveg(p) w(u v)). Describe a 2k. W-n°™-time polynomial space Monte 
Carlo algorithm for this problem. 


10.19. Describe a Monte Carlo 23*n©()-time polynomial-space algorithm for the TRIAN- 
GLE PackING problem: given a graph G and a number k € N, decide whether G contains 
k disjoint triangles (subgraphs isomorphic to K3). 


10.20. In the COLORFUL GRAPH MotiF problem we are given a graph G, a coloring 
c: V(G) +N, and a finite set of colors M C N. Denote k = |M]. The goal is to decide 
whether there is a subset S C V(G) of size k such that G[S] is connected, and c(S) = M. 


(a) Describe a c*n()-time polynomial-space algorithm for a constant c, ideally a 2*nO()- 
time Monte Carlo algorithm. 
(b) The same, but in the case when M is a multiset (this version is called GRAPH Mortir)? 


(2) 
10.21. Modify the dynamic programming from Lemma |10.24| to make it evaluate the 
polynomials Qx from Section [10.4.2 


Hints 


Show that by counting appropriate Steiner trees one can compute the number of 
k-vertex paths connecting a fixed pair of vertices. 


Modify slightly the 2”n©()-time algorithm for the regular vertex coloring. 
Inclusion—-exclusion. 


10.14| Use induction on the number of variables n, as in the proof of the Schwartz-Zippel 
lemma. 


10.15] To show that there is at least one such polynomial, consider any polynomial p cor- 
responding to f and note that if p has a monomial that contains «7 for some indeterminate 
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xi, then by replacing x? by x; 7 we get another polynomial corresponding to f. To show 
that there is at most one such polynomial, assume the contrary and use Exercise]10.14 


10.17} Modify the 2*n©@)-time algorithm for Loncrst PATH. 


There are at least two approaches. Perhaps the simplest approach is to consider 
every possible weight i = 0,...,kW and check if there is a k-path of weight i by focusing 
on walks of weight i only. Another approach could be to introduce a new variable z which 
will not be evaluated and the degree of z in a monomial corresponds to the weight of the 
relevant walk. 


10.19] Use a similar approach as in the 2*n©()-time algorithm for k-path from this 
chapter. 


as in Exercise |10.17| using branching walks. To get 2knO) running time in the latter 
approach observe that one can use M as the set of labels. The extension to the multiset 
variant is just a matter of technical details in the latter approach. 


10.20) Point i can be done using two approaches: reduce to STEINER TREE or proceed 


10.21] Keep track not only of the first vertex of the walk, but also of the second one. 


Bibliographic notes 


The principle of inclusion—exclusion is one of the most basic tools in combinatorics. The 
books of Aigner [7], Ryser [408], and Stanley contain detailed discussions of the 
method. The application of the inclusion—exclusion principle for Hamiltonicity was redis- 
covered several times. The algorithm running in time 2"n°“) and using polynomial space 
for the problem was given by Kohn, Gottlieb, and Kohn in 1969 [302]. It was rediscovered 
in 1982 by Karp [282], and in 1993 by Bax [27] [28]. For an overview of the area of exact 
exponential algorithms we refer the reader to surveys of Woeginger and the book 
of Fomin and Kratsch [197]. 

The notion on branching walk and the poly-space O(2!*ln©))-time algorithm for 
the unweighted STEINER TREE problem is due to Nederlof [374]. This algorithm can be 
extended to the weighted case; as in Exercise [10.7] for integer edge weights bounded by C, 
this comes at the cost of an O(C) factor in the time and space complexity. Nederlof and 
Lokshtanov show that the space can be improved to n°) log C, hence polynomial in 
the input size, as opposed to n°) C, called pseudo-polynomial (the paper contains similar 
consequences for other problems, like SuBsET Sum or KNapsAck). However the running 
times of polynomial-space algorithms in is O(2!* lnOCMC), which is not FPT. The 
first algorithms for graph coloring with running time 2"n°@) were given independently 
by Björklund and Husfeldt and Koivisto (see the joint journal article [40]). The Möbius 
inversion formula presented in this chapter can be seen as a generalization of the well-known 
Möbius inversion formula from number theory. The version for the subset lattice is usually 
attributed to Weisner and Hall [255]. The fast zeta transform algorithm dates back to 
Yates but it was first used in the context of FPT algorithms by Bjérklund, Husfeldt, 
Kaski and Koivisto [37], who presented also the algorithms for subset convolution and 
cover product included in this chapter and also for packing product (Exercise|10.13). The 
odd-negation transform is from lecture notes of Kaski [284]. The fast subset convolution 
algorithm from is faster by a factor of O(n) than the one presented in this chapter. 
The Ryser formula (Exercise[10.9) is due to Ryser [408]. 

Koutis [305| was the first to observe that using algebraic methods one can enumerate all 
k-walks and get the non-path walks cancelled. His approach was to directly check whether 
a polynomial contains a multilinear monomial using group algebras. Koutis obtained a 
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23k/27°()-time algorithm which was next improved to 2*n°@) by Williams with 
similar methods. The idea of using bijective labellings to get the non-path walks can- 
celled was discovered by Bjérklund as a tool for his 1.66"n°()-time algorithm for 
the Hamiltonian cycle problem in undirected graphs. It was next extended by Björklund, 
Husfeldt, Kaski and Koivisto to a 1.66*n°()-time algorithm for k-path in undirected 
graphs and the algorithms from Section are based on this work. The slight speed-up 
from 23*/4n°2) = 1.682'nO) to 1.66*nOW) is obtained by using [(# — e)k] labels for 
some e€ > 0. This comes at the price of getting extremely low probability that the solu- 
tion k-path is not filtered out, so the whole algorithm needs to be repeated a(€)” number 
of times, for some function a. For the optimal choice of e the overall time complexity is 
1.66*n©@), Björklund attributes this speed-up to Williams. Koutis’ paper con- 
tains also the first 23*n©()-time algorithm for triangle packing (Exercise moe the 
currently best result is the 1.493°*n©()-time algorithm of Björklund, Husfeldt, Kaski and 
Koivisto [89]. The 25n ()-time algorithm for the multiset version of the GRAPH MOTIF 
problem (Exercise[10.20} is due to Björklund, Kaski and Kowalik [£I]. They also show that 
the existence of a (2 — €)*n©()-time algorithm for the GrapH Morir problem implies a 
(2 — e')FnO -time algorithm for Set Cover. 

For the construction and properties of finite fields, see the textbook of Lidl and Nieder- 
reiter [321]. For a fixed prime p, multiplication in a finite field GF (p”) can be performed 
in time O(n lognloglogn) by applying the Schönhage-Strassen algorithm for integer 
multiplication (see [226]). 

The Schwartz-Zippel lemma was found independently by DeMillo and Lipton [136], 
Zippel and Schwartz (their formulations were not completely equivalent but 
each of them would suffice for our needs in this chapter). We use the name Schwartz- 
Zippel lemma because it is well established in the literature. 

Kabanets and Impagliazzo show that the existence of a polynomial-time algo- 
rithm for POLYNOMIAL IDENTITY TESTING would imply arithmetic circuit lower bounds 
for NEXP, which would be a major breakthrough in computational complexity. 

In all algorithms from Section [10.4] we could have used the isolation lemma (see Chap- 
ter|11) instead of the Schwartz-Zippel lemma, at a price of significantly higher polynomial 
factors in the running time. The essence of the algorithms (cancelling of objects corre- 
sponding to non-simple walks) stays the same. However, it is very enlightening to see the 
other approach and be able to translate the algorithms from one approach to another. We 
refer the reader to the paper of Fomin and Kaski [196] with an accessible exposition of a 
2knO)-time k-path algorithm described in the framework of the isolation lemma. 


Chapter 11 
Improving dynamic programming on 
tree decompositions 


In this chapter we consider more sophisticated 
dynamic-programming routines for graphs of bounded 
treewidth, compared with the classic ones described in 
Chapter [7 We give examples of algorithms based on 
subset convolution, Cut & Count and rank-based ap- 
proaches. 





In Chapter |7| we have seen a few examples of straightforward dynamic 
programming for graphs of bounded treewidth. The idea behind the design 
of those algorithms was to use small vertex separators, namely the bags of the 
tree decomposition, to extract succinct information about partial solutions 
that is enough to carry on the construction of a final solution. In the case 
of DOMINATING SET (see Section for each vertex it was enough to 
store one of its three possible states, which led to O(3") states per node of 
the tree decomposition. However the running time needed to perform the 
computation in a straightforward manner in the join nodes was O(4*). In 
Section [17.1] we consider the ## PERFECT MATCHINGS problem and again the 
DOMINATING SET problem, where the standard approach gives respectively 
O(3*) and O(2*) states per node of the tree decomposition of width k, but 
it is not clear how to obtain such a running time in join nodes. We show how 
to obtain such running times by invoking the fast subset convolution from 
Chapter [10] 

Next, in Section 11.2] we deal with a class of problems with some kind of 
global requirement in the problem definition, informally called connectivity 
problems. For all of those problems the standard dynamic-programming ap- 
proach gives kO™) states per node, whereas we would like to have only single 
exponential dependence on treewidth. 
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First, we show a randomized approach called Cut & Count, which by 
reduction to counting modulo 2, together with cut-based cancellation argu- 
ments, gives us Monte Carlo algorithms with 2°“) dependence on treewidth 
(for the definition of Monte Carlo algorithms see Chapter [5). 

In Section [11.2.2] we show how, by using tools from linear algebra, one 
can obtain deterministic algorithms with similar running times (though with 
a worse base of the exponential function). The main idea is to treat states 
in the dynamic programming computation as vectors in a suitably defined 
linear space. Even though seemingly this translation makes things more com- 
plicated, it shows that linear dependence of vectors enables us to discard some 
states and bound the number of considered states by 20%), 


11.1 Applying fast subset convolution 


In this section we apply fast algorithms for subset convolution developed 
in Chapter to obtain faster dynamic programs on graphs of bounded 
treewidth. 


11.1.1 Counting perfect matchings 


Let us recall that a matching M of a graph G is perfect if all vertices of G 
are matched by M, i.e., every vertex of G is an endpoint of some edge from 
M. Consider the following ##PERFECT MATCHINGS problem: given a graph 
G, the task is to find the number of perfect matchings in G. This problem 
is #P-complete and thus it is unlikely to be solvable in polynomial time. 
We will show an efficient FPT algorithm (parameterized by treewidth) which 
applies the fast subset convolution described in Section 

We assume that we are given a nice tree decomposition T = (T, {Xi }revr)) 
of graph G of width k. For each node t € V(T) and set S C X+, we com- 
pute c[{t, S], which is the number of matchings in the graph G[V;], the graph 
induced by the vertices contained in X, and the bags below X;, such that: 


e all vertices in V; \ X; are matched, 
all vertices in S are matched to vertices in V; \ X+, and 
all vertices in X; \ S are unmatched. 


Note that if r is a root of T then cfr, ø] is the number of perfect matchings 
in G. Let us describe how the nodes of the tree are processed. 


Leaf node. Since in a nice tree decomposition each leaf corresponds to an 
empty bag, the number of matchings that use no vertex equals 1, i.e., it is 
the empty matching. Therefore we set c[t,@] = 1 for every leaf t. 
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Introduce node. Let t’ be the child of an introduce node t and X; = XyU{v} 
for some v ¢ Xy. Observe that the vertex v cannot be matched yet, as it 
is not adjacent to vertices of V; \ X+. Hence, for every S C X, we have 
c{t, S] = ct’, S] if v € S and clt, S] = 0 if v € S. 


Forget node. Let t’ be the child of a forget node t, that is, for some w € Xy 
we have X; = Xv \ {w}. Then, 


ct, S] =c, SU{w}]+ XO e, S\ {vo}, 


vESNNG(w) 


where the first summand corresponds to the choice of matching w to a vertex 
of Vy \ Xv, while the second summand (i.e., the sum over v € SM No(w)) 
corresponds to matching w to a vertex of S. 

Note that processing the leaf nodes, introduce nodes and forget nodes takes 
2% . kO) time only. The join nodes are slightly more involved. 


Join node. Let tı and tz be the children of a join node t. Thus X; = X, = 
Xt,- Note that if a vertex of X; is matched to V;\ X; then it is matched either 
to Va \Xz,, or to Va \ Xi, but not to both since these sets are disjoint. Thus, 


clt, S] = XL citi, A] - elt2, 9 \ Al. 
ACS 


The naive computation of the values of c[t, S] for all S C X, using the above 
formula takes time proportional to Dscx, ais! — 3lX+l < 3'+1_ However, we 
can treat the entries c|t, S] for a fixed node t as a function f : 2*' — N such 
that for every S C X; we have f(9) = c[t, S]. Similarly, define g, h : 2** > N 
such that for every S C X; we have g(S) = c[t1, S] and A(S) = c[t2, S]. Then, 


ct, S] = f(3) = XL g(A)-h(S\ A). 


ACS 


In other words, computing c[t, S] for all subsets S is just computing the subset 
convolution of g and h, which can be done in time 2!*#l . | X,|0® = 24 .kOM 
by Theorem {10.15} Let us conclude with the following theorem. 


Theorem 11.1. Let G be an n-vertex graph given together with its tree de- 
composition of width k. Then one can find the number of perfect matchings 
in G in time 2% -kO n, 


11.1.2 DOMINATING SET 


Recall the 4" -kO . n-time algorithm for the DOMINATING SET problem 
parameterized by treewidth k from Section It turns out that it can 
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be accelerated to time 3° -kO™ . n using the fast subset convolution from 
Section[10.3] Note that processing introduce nodes and forget nodes is already 
fast enough. We will show here that for a join node t the values of c[t, f] for 
all the 3° functions f : X, > {0,0,1} can be computed in time 3*£0(. 
Recall that 


elt, J] = min felti, fa] + lta, fa] — I0] 


where tı and tz are the children of t (recall that X;, = Xa = X; in the 
nice tree decomposition) and the minimum is taken over all colorings fi, fo : 
X+ — {0,0,1} consistent with f, that is, 


e f(v) =1if and only if fi(v) = fe(v) = 1, . 
e f(v) =O if and only if (fi(v), fo(v)) € {(0, 0), (0, 0)}, 


e f(v) = 0 if and only if f,(v) = fo(v) = 0. 
Note that the above three conditions are equivalent to 


© f M=f (=f), 
e f= (0) = fT MU FO), 
e FON fz 0) =A. 

One could think that the condition f—!(0) = f71(0)N fz1(0) is missing 
here; however, note that it is implied by the first two. 

The above considerations suggest that if we fix f~1(1), then what we want 
to compute resembles the subset convolution. Let us follow this idea. Fix a 
set R C X;. Let Fr denote the set of all functions f : X, > {0,0,1} such 
that f~1(1) = R. We want to compute the values of c[t, f] for all f € Fr. 

Note that every function f € Fp can be represented by a set S C X;\ R, 
namely the preimage of 0. Hence we can define the coloring represented by 
S as 

1 ifzeR, 
gs(z)=40 ifxeS, 
0 iffe X,\(RUS). 


Then for every f € Fr we have 


ct, fJ= min, (elf, ga] + elt2, gal) — IF 0]. 


AUB=f~—1(0) 
ANB=0 


For v € {t1,t2}, we define functions c, : 2*+;\R — N such that for every 
S C X; \ R we have c,(S) = civ, gs]. We see that for every S C X; \ R, 
cft, gs] = (ct, * Ctz)(S) — |R], 


where the subset convolution is over the min-sum semiring (see Section|10.3.2). 
Note that the table c stores integers bounded by the number of vertices in 
the graph. Hence, by Theorem}10.17| we can compute cft, f] for every f E€ Fr 
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in 2IX:\Fl . n°) time. Since Y pcx, 2'**\Fl = 3l%+! < 3+1, the total time 
spent for all subsets R C X; is 3% -nOM, 


Theorem 11.2. Let G be an n-vertex graph given together with its tree de- 
composition of width k. Then DOMINATING SET is solvable in time 3° nO, 


In Exercise we ask the reader to improve the dependence on n in 
Theorem from polynomial to linear. 


11.2 Connectivity problems 


In Section [7.3.3] we gave a standard dynamic-programming algorithm solving 
STEINER TREE in time k?n on graphs of treewidth k. The main reason this 
approach suffers from the kO‘*) dependence on treewidth is that we have to 
keep track of partitions into connected components. In this section we show 
two methods that allow us to bypass the need to store all the possible parti- 
tions. In Section [11-2-1]we describe a randomized approach which reduces the 
decision variant of the problem to counting modulo 2, which as in Chapter [10] 
appears to be a fruitful approach. Next, in Section /F11.2.2] we show a greedy 
method based on tools from linear algebra that allow us to store only a single 
exponential number of partitions. 


11.2.1 Cut & Count 


Assume that we are given a graph G, together with a set of terminals K C 
V(G), a tree decomposition 7 of G, and an integer @. We are to decide whether 
there exists a Steiner tree of size at most @. In other words we are to check 
whether there is a connected subgraph of G with at most £ edges containing all 
the terminals K. Our strategy is first to solve the counting modulo 2 variant 
of the problem, i.e., find the parity of the number of connected subgraphs of 
G with at most £ edges spanning all the vertices of K. A priori this does not 
seem to be easier than solving the decision variant, but as we are about to 
see working modulo 2 allows cancellation tricks as in the case of finite fields 
of characteristic 2 in Section [10.4 

Let R be the set of all subgraphs of G with at most £ edges containing 
K, and let S be the set all solutions, i.e., all subgraphs from R which are 
connected. A formal definition follows. 


R={HCG: |E(H)| <¢,K CV(A)} (11.1) 
S={HER : H is connected} (11.2) 
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By straightforward methods we can not only find the parity of |R| but also 
the value of |R] itself in time 2*n©) (see Exercise|11.4). However this is not 
exactly what we are after. We would like to use some kind of overcounting 
argument, so that each element of R \ S would be counted an even number 
of times, while each element of S would be counted an odd number of times. 
For this reason we define a family of cuts, which are just partitions of the set 
of vertices V(H) into two sets V+, V?. We say that a subgraph H of G and 
a cut (Vt, V?) are consistent if no edge of H has its two endpoints on two 


different sides of the cut, i.e., when E(H) C (a) U an where (>) denotes 
all size 2 subsets of X (edges with both endpoints in X). 

For a subgraph H of G let us ask ourselves a question, how many cuts 
is the graph H consistent with? Since no edge of H can go across the cut, 
each of its connected components has to be either entirely in Vt, or entirely 
in V2. Other than that we are free to choose the side of the cut for every 
connected component of H independently. This means that a subgraph H 
with cconnected components is consistent with 2° cuts. This is almost perfect, 
as all subgraphs with more than one connected component are consistent 
with an even number of cuts, but unfortunately a connected subgraph H is 
consistent with two subgraphs, instead of one subgraph as we would like it 
to be. For this reason we select some arbitrary terminal vertex vı € K, and 
fix it permanently into the V! side of the cut. For a subset W C V(G) and 
vı € K define 


cuts(W, v1) := {(V1, V?) : V'UV? =W AVIAV? =bA^v €V'}. 


Now we show that with the updated definition of cuts instead of calculating 
the parity of |S| directly, we can count pairs consisting of a subgraph from 
R and a cut consistent with it. 


C = {(H,(V',V”)) E R x cuts(V (H), v1) : H is consistent with (V1, V?)}. 
(11.3) 


Lemma 11.3. The parity of |C] is the same as the parity of |S]. 


Proof. Consider a graph H from R with c connected components. Each con- 
nected component has to be contained either in V! or in V?, as otherwise 
there would be an edge crossing the cut. However, the connected component 
of H containing vı has to be contained in the Vt-side of the cut. Conse- 
quently H is consistent with 2°71 cuts, which is an odd number for H € S 
and an even number for H € R \ S. 














Therefore it remains to show how to compute the parity of |C| by standard 
methods with only 2°(*) dependence on treewidth. The key observation is 
that we can do that by storing local information only, that is, unlike in 
Section we do not need partitions in our dynamic programming state 
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anymore. For each node t € V (T), an integer i < £, and a function f : X; > 
{0,1,2} we compute cft, f, i], which is the number of pairs (H, (V1, V2)), such 
that: 


e H is a subgraph of (V;, E+) with exactly i edges, and H contains all the 
vertices of K N V;, 

e (V!,V7?) is a cut consistent with H, i.e., V! U V? = V(H), VIA V? = b, 
U1 g V?, 

e the set of vertices of H intersecting the bag is exactly f—'(1) U f~'(2), 
ie, V(H) AX; = f-4(1) U f-1(2), 

e the function f describes how vertices of X;\ f~1(0) are assigned to Vt 
and V®, that is, VÍ N V(H) NA X: = ft (j) for j € {1, 2}. 


We stop the description of the dynamic-programming routine here and 
leave the details to the reader (see Exercise|11.5), as no new ideas are needed 
from this point. The number of states we have defined is 3°n°™), and this is 
also the running time of our algorithm computing |C|, which by Lemmaf11.3jis 
sufficient to find the parity of |S|. Consequently we established an algorithm 
for the counting modulo 2 variant of STEINER TREE with 2°“) dependence 
on treewidth, which was our first goal. 

It remains to show why solving the parity version of the problem is help- 
ful when facing the decision version. In Chapter [10] we have already seen an 
example of this, when we were working with polynomials in fields of charac- 
teristic 2. Here, we use an alternative to polynomial identity testing, which 
is called the isolation lemma. 

Consider the following experiment: we choose a large integer N and as- 
sign to every edge e € E(G) a weight w(e) € {1,2,...,N} uniformly and 
independently at random. For a subgraph H € R, we define its weight as 
w(H) = } cmc) W(e)- For an integer w, let Ry ={H ER : w(H) =w} 
and similarly define S,, and Cw. It is a standard task to adjust the aforemen- 
tioned dynamic-programming algorithm to compute |C,,| for all reasonable 
weights w, at the cost of an additional polynomial factor in n and N in the 
running time bound. Intuitively, large N should ensure that in a yes-instance 
the random choice of weights sufficiently shuffles the solutions among the 
sets Sw such that for at least one weight w the set Sw is of odd size, and 
we can detect it by computing |C,,|. The isolation lemma is an even stronger 
statement: it asserts that it suffices to take N = 2(|E(G)|) to obtain, with 
constant probability, at least one set S,, of size exactly 1. 

Let us now proceed with a formal statement and proof of the isolation 
lemma. 


Definition 11.4. A function w : U — Z isolates a set family F C 2" if there 
is a unique S’ € F with w(S’) = mingser w(S). 


In the above definition w(X) denotes 7 ex w(u). 


Lemma 11.5 (Isolation lemma). Let F C 2” be a set family over a uni- 
verse U with |F| > 0. For each u € U, choose a weight w(u) € {1,2,...,N} 
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uniformly and independently at random. Then 


we 


P isolat > 
r(w isolates F) > N 


Proof. Let u € U be an arbitrary element of the universe U, which appears 
in at least one set of F and does not appear in at least one set of F (note 
that such an element does not need to exist). For such u we define 


w(S)— min w(S\ {u}. (11.4) 


a(u)= min 
ugs SéF ues 


SEF 


Note that a(w) is independent of the weight of w(u), as w(u) it not used at 
all at the right-hand side of (11.4). Consequently 


Pr(a(u) = w(u)) < 1/N, (11.5) 


since we can imagine that the weight w(u) is sampled in the end, when all 
the other weights, and a(u) in particular, are already known. 

Assume that there are two sets A, B € F of minimum weight with respect. 
to w and let x be an arbitrary element of A \ B (note that A\ B #0 as all 
the weights are positive). We have 


a(z)= min w(S)-— min w(S\ {z}) 


SEF ,x¢S SEF ,xES 
= w(B) — (w(A) — w(2)) 
= w(x). 


Hence if w does not isolate F then there is an element x and sets A, B € F 
such that « € A \ B and a(x) = w(x). However, by (11.5) and the union 
bound, this happens with probability at most |U|/N. 














The reason Lemma [IT.5]is useful for us is that it gives a reduction from the 
decision version of the problem to the parity version as follows. Our universe is 
U = E(G), and we put F = {E(H) : H € S}. We sample an integral weight 
w(e) from {1,...,2|U|} for each element of the universe e € U, i.e., for each 
edge. For each w between 1 and 2|E|@ we compute the parity of the number 
of pairs in C, where the subgraph H is of weight exactly w with respect to w, 
which can be easily handled by the dynamic-programming routine by adding 
a polynomial overhead in the running time. If for some w the parity is odd, 
we answer YES, otherwise we answer NO. 

Even though there might be exponentially many solutions, that is elements 
of S, with probability at least 1/2 for some weight w between 1 and 2|E|é 
there is exactly one solution from S of weight w with respect to w. As a 
corollary we obtain a Monte Carlo algorithm where the probability of error 
is at most 1/2. 
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Theorem 11.6. There is a one-sided error Monte Carlo algorithm with false 
negatives which solves the STEINER TREE problem in time 3*n°™ given a 
tree decomposition of width k. 


The described technique may be applied to several problems, with a global 
connectivity constraint, which we call connectivity problems. A list of such 
problems will be given in Theorem|11.13} The general scheme of the approach 
is as follows. 


The Cut & Count technique may be applied to problems involving a 
connectivity requirement. Let S C 2” be a set of solutions (usually 
the universe U is the set of vertices or edges/arcs of the input graph); 
we aim to decide whether it is empty. Conceptually, Cut & Count can 
naturally be split into two parts: 


e The Cut part: Relax the connectivity requirement by considering 
the set R D S of possibly disconnected candidate solutions. Further- 
more, consider the set C of pairs (X,C) where X € R and C is a 
consistent cut of X. 

e The Count part: Isolate a single solution by sampling weights of 
all elements in U. Compute |C| mod 2 using a sub-procedure. Non- 
connected candidate solutions X € R \ S cancel since they are con- 
sistent with an even number of cuts. The only connected candidate 
x € S remains (if it exists). 


Finally we note that not only is the general scheme of the presented tech- 
nique rather simple, but it also gives small constants in the base of the ex- 
ponent (e.g., see Exercises (11.6]and [11.7). As a matter of fact, often the con- 
stants in the base of the exponent given by the Cut & Count approach are 
optimal under the Strong Exponential Time Hypothesis (see Theorem|14.41). 


*11.2.2 Deterministic algorithms by Gaussian 
elimination 


Even though the Cut & Count technique allowed us to break the kO“) barrier, 
it is natural to ask whether one can achieve single exponential dependence on 
treewidth by a deterministic algorithm. Additionally the approach presented 
in this section allows for smaller polynomial factors in the running time, and 
at the same time is able to handle arbitrary real weights in optimization prob- 
lems, which the Cut & Count technique inherently cannot capture. However, 
the drawback of the new approach is a worse base of the exponential function 
in the dependence on treewidth. 
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Recall the standard dynamic-programming approach for the STEINER 
TREE problem from Section In this section we slightly change the 
definition of a state as follows. For a bag X;, a set X C X; (a set of vertices 
touched by a Steiner tree), and an integer i, we define c|t, X, i] as the family 
of all partitions P = {P}, P2,...,P,} of X for which there exists a forest F 
in (V;, E+) such that 


e |E(F)| =i, i.e., F has exactly i edges, 

e F has exactly q connected components Cj,...,Cqg and for each s € 
{1,...,¢}, Ps C V(C,). Thus the partition P corresponds to connected 
components of F; 

e X:NV(F) =X, i.e., vertices in X, \ X are untouched by F; 

e every terminal vertex from K N V; is in V(F). 


Note that there are two significant differences between the dynamic pro- 
gramming of this section and the one from Section First, the size of 
the Steiner tree under construction is explicitly stored in the state under the 
variable i; earlier we just stored the minimum size as a table entry. Second, 
instead of storing a partition as a part of the state description, we store it 
in clt, X, 2], as c[t, X, i] is now a set family, not a number. It should be clear 
that the dynamic-programming routine from Section [7.3.3] can be adjusted 
to handle the new definition of a state in our table[] 

The total space used in this algorithm is kOn), and the reason for 
this is the number of possible partitions of the set X, i.e., the set of vertices 
of the current, bag touched by the Steiner tree under construction. Note that 
the number of subsets X C X; is at most 2*+!, which is single exponential, 
hence it is enough if we could somehow consider not all the partitions but 
only 2°(*) of them. 

Fix a subset X C X; and define U = X. Let I(U) be the set of all 
partitions of U. A single set within a partition is called a block. For two 
partitions P,Q € I(U) denote by P U Q the most refined partition R of 
U, such that every block of both P and Q is contained in a single block of 
R (in the partition lattice language P U Q is the join of P and Q). Less 
formally, if we imagine two arbitrary graphs Gp, Gg on the same vertex set 
U, where the family of connected components of Gp is P and the family 
of connected components of Gg is Q, then P U Q corresponds to the set 
of connected components in the union of Gp and Gg (in particular if the 
union of Gp and Gg is connected, then P U Q = {U}). For example in a 
universe of four elements {a, b, c, d}, and partitions P = {{a}, {b}, {c, d}} and 
Q = {{a, b}, {c}, {d}}, we have P U Q = {{a, b}, {c, d}}. 

Consider a very large matrix M. Each row and column of M corresponds 
to a partition of U. An entry in row P and column Q is equal to 1 if PU Q 
is the single block partition {U}, and zero otherwise (see Fig. Ei). 





1 An observant reader probably noticed that with this new definition the total number of 
states is quadratic in n. We ignore this loss at the moment as we will oust this additional 
O(n) factor later. 
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Intuitively each row of the matrix M is a partition corresponding to 
some partial solution in the graph (V;, Æ+), that is, the part of the graph 
which we have seen so far in the decomposition. At the same time each 
column of M corresponds to a potential partition of X into connected 
components by the part of the solution contained in E(G) \ Æ. A single 
value in the matrix tells us whether two such parts of a solution, the one 
which we have already constructed and the one we are still to construct, 
are compatible, i.e., if when joined they give a connected subgraph. 


Definition 11.7. Define matrix M € {0,1}70*"@) as 


Mee f if PU Q= {U}, 
0 otherwise. 

It seems that the information encoded by M is excessive for keeping con- 
nected sets in dynamic programming. For example, consider rows 2, 3, and 
4 of the matrix M in Fig. [1.I]and let P, Q, and R be the partitions of U 
corresponding to these rows. Observe that in each column of M the total 
number of 1s in rows { P,Q, R}, is either zero, or at least two. Consequently, 
for every partition X € (U), if there is a partition S € {P, Q, R}, such 
that X US = {U} (ie., M[X,S] = 1), then there is also S’ € {P,Q, Ry, 
such that S # S, and X U S’ = {U}. This means that in our example, if 
we have all three partitions P,Q, R in some family c[t, X, i] in the dynamic- 
programming routine, then we can discard one of them! Unfortunately, we 
do not know how to efficiently check whether some partition is irrelevant 
with respect to the mentioned criterion, as the number of columns of M is 
|[U|2UUD and we cannot afford iterating over all of them. 

For this reason we pick a different route. Now consider the first four rows of 
the matrix and observe that their sum is a zero vector when we work in GF(2) 
(see Section [10.4] for properties of finite fields). But when a sum of vectors is 
a zero vector in GF(2), then for any column there is an even number of 1s, 
and in particular there is no column with a single 1. Our strategy is to use 
computation in GF(2) to show an upper bound on the rank of M, as when we 
have a subset of rows of M of carnality greater than rank(M), there always 
exists a subset summing to zero. 

In order to prove a clUl upper bound on the rank of M we again consider 
a class of simple partitions, namely cuts into two sets, just as we did in 
Section[11.2.1] The following lemma shows that when operating in GF(2), the 
matrix M can be decomposed into a product of two cutmatrices C, defined 
as follows. 

In Definition [11.8] we use relation C on partitions, where P C Q when P 
is coarser than Q, meaning that each block of Q is contained in a single block 
of P, or in other words that Q is a refinement of P. 
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Fig. 11.1: Each row and column corresponds to a partition of a 4-element 
universe, where the topmost vertex of a vertical graph corresponds to the 
leftmost vertex of a horizontal graph. Each entry of this matrix is equal to 
1 iff the union of the two graphs is connected, i.e., when the join of the two 
partitions is the partition with a single block 


Definition 11.8. Define 
cuts(U) := {(V1, V?) : VIUW=UAV' NV? =bA%y EV+}, 
where vı stands for an arbitrary but fixed element of U. Define a matrix 


C e {0,1} 7) xextsU) by CIQ, (V!,V2)] = [(V1, V2) E Q]. In other words, 
element (Q, (V1, V?)) of C is 1 if (V1, V?) E Q and 0 otherwise. 








Matrix C represents which pairs of partitions and cuts are consistent, i.e., 
the partition needs to be a refinement of the cut. It is crucial that one side of 
the cut is fixed to contain a particular vertex since that allows the following 
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argument via counting modulo 1: For each pair of partitions, the product CCT 
can be seen to count the number of partitions that are consistent with both P 
and Q. If P U Q = {U} then there is only one such partition, namely (U, Ø). 
Otherwise, if P U Q is a partition with at least two sets, then there is an 
even number of cuts that are consistent, as for each block of P U Q which 
does not contain vı € V', we are free to put the block either entirely to V1 
or into V?, independently of other blocks. 


Lemma 11.9. It holds that M => CCT, where => denotes equality in GF(2). 


Proof. For every P,Q € IT(U) we have that 





(CCT)[P, Q] = So (WW) E P] [VVER] 


(V1,V2)ecuts(U) 


= y (Vt, V?) E Pa (Vt, VAE Q] 


(V1,V2)€cuts(U) 


= J [VVA EPUQ] 
(V1,V2)ecuts(U) 
Z gi#blocks(P u Q)-1 


= [P U Q = {U}. 











The crux here is that we count the number of cuts which are coarser than 
both P and Q, i.e., cuts (V1, V?) € cuts(U) such that each set of P or Q is 
completely contained in V+ or V?. This is equivalent to a situation where each 
block of P U Q is contained in V! or V?. Considering the blocks of P U Q 
it is clear that the block containing vı must be in Vt, by the definition 
of cuts(U). Any further block can be either in V! or V?, which gives a total 
of Q#>locks(P U Q)—1 coarser cuts. Clearly, that number is odd if and only if 
there is a single block in P U Q (the one containing 1), i.e., if P U Q = {U}. 














Based on Lemma [11.9] we prove that instead of looking for linear depen- 
dence between rows in M we can in fact look for linear dependence in C, 
which is much more convenient as C has a single exponential (with respect 
to |U|) number of columns, while M does not. The following lemma proves 
that given a set of partitions linearly dependent in C we can discard any one 
of them. (Here, C/Q,-] denotes the row of C corresponding to the partition 


Q.) 


Lemma 11.10. Let X 
U 


Vaex Cl, ] and P 
QU R= {U}. 


I(U) and P,R € I(U) such that C[P,-] = 


C 
R {U}. Then, there exists Q E€ X such that 


Proof. We know that by Lemma 


370 11 Improving dynamic programming on tree decompositions 


MIP, ‘| =2 CIP, Je? 
{by assumption } 
= (X CQ, Jc" 
QEX 
{by linearity of matrix multiplication } 
=2 5 MIQ, J. 
QEX 


In particular, M[P, R] =2 X oex MIQ, R]. Thus, if M[P, R] = 1 then 
MIQ, R] = 1 for some Q € ¥ and the lemma follows. 














Lemma |11.10|implies that finding a basis of the row space of the matrix 
C by standard Gaussian elimination gives the following theorem. 


Theorem 11.11. There is an algorithm that given a set of partitions A C 
II (U) in time |AJO®20UU)) finds a set A’! C A of size at most 2101-1, such 
that for any P € A and R € I (U) with P U R = {U}, there exists a partition 
QEA, such that QU R = {U}. 


Proof. Let A’ be a basis of the row space of the matrix C[A,-] (in particular 
A’ is a subset of rows of C[.A,-] of maximum rank, which is why the approach 
is called rank-based). We can find A’ by standard Gaussian elimination in 
time |A|OM 2°07). As the matrix C has 2!¥!-1 columns, the rank of C[A,-], 
and hence the size of A’, is upper bounded by 2!¥!-!. 

Consider any P € A and R € H(U) with PU R = {U}. If P € A’ then 
it is enough to take Q = P. Otherwise P was not taken to the basis of 
the row space of C, which means that there is a subset Y C A’ such that 
C[P,-] =2 X oex CIQ, ]. Then, by Lemma [L110] there is Q € A’ such that 
Q U R= {U} and the lemma follows. 














Let us take a step back and recall what was our goal for this section. In 
our dynamic-programming approach for the STEINER TREE problem, for a 
fixed node X; of the given tree decomposition, a fixed subset X C X;, and a 
fixed integer i we wanted to limit the number of considered partitions of X 
in c|t, X, 7]. In order to do that for every triple (t, X, i), instead of computing 
the entire family c|t, X, i] we only compute a subset c'[t, X, i] C cft, X, i] as 
follows. 

Fix a triple (t, X,i). We want to compute a subset c’[t, X,7] C cft, X, il, 
given the values c’[t’,-,-] where t’ iterates over the children of t. In the 
straightforward dynamic-programming algorithm, similar to the one of Sec- 
tion [7.3.3] the recursive formula for c|t, X, i] depends on the values c[t’, -, -] for 
the children of t. In our algorithm, we first apply the aforementioned recur- 
sive formula for c|t, X, i], but taking instead the values c’'[t’,-,-] as an input. 
Second, we apply Theorem [11.11] to the obtained set to shrink its size to at 
most 2!*!-!, obtaining the final value c’[t, X, i]. In particular we are keeping 
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only a single exponential number of partitions, deciding which to keep in a 
greedy manner based on the Gaussian elimination. 

Note that in order to prove the correctness of the approach, one needs the 
following lemma, which can be proved by inspecting the dynamic program- 
ming recursive formulas in each node of the tree decomposition. On a high 
level the proof of the following lemma is not complicated, however formaliz- 
ing all the details is a nontrivial and rather tedious task, hence we omit the 
proof. 


Lemma 11.12. For any t, X,i, partition P € cit, X, i], and partition R of X 
such that P U R = {U}, there exists Q € c'|t, X, i] such that Q U R = {U}. 


In particular, Lemma implies that if c|t,X,i] is nonempty then 
c'[t, X, i] is also nonempty, which guarantees that when deciding what the 
final answer is (after the dynamic programming computation is done), c’ and 
c lead to the same result. 

It is not hard to extend Theorem [11.11] to a weighted setting (see Ex- 
ercise which allows us to solve the weighted variant of the STEINER 
TREE problem with single exponential dependence on treewidth. By con- 
sidering weighted partitions we can omit the accumulator i from our state 
definition, and retrieve the linear dependence on n (see Exercise [11.13). 

The rank-based approach presented above also works for other connectivity 
problems. 


Theorem 11.13. Let G be an n-vertex graph given together with its tree 
decomposition of width k. Then 


STEINER TREE, 

FEEDBACK VERTEX SET, 

CONNECTED DOMINATING SET, 
HAMILTONIAN PATH and LONGEST PATH, 
CONNECTED VERTEX COVER, and 
CONNECTED FEEDBACK VERTEX SET. 


are solvable in time 20%) . n. 


As in Section by combining Lemma with Theorem |11.13| we 
obtain the following corollary, which improves the exponent of the running 


time from Corollary by a log k factor. 


Corollary 11.14. On planar graphs 


STEINER TREE parameterized by the size of the tree, 
FEEDBACK VERTEX SET, 

CONNECTED DOMINATING SET, 

LONGEST PATH, 

CONNECTED VERTEX COVER, and 

CONNECTED FEEDBACK VERTEX SET. 
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are solvable in time 20(V®) nO, 


We would like to note that the CYCLE PACKING problem was listed in The- 
orem but does not appear in Theorem The intuitive reason for 
this is that in the CYCLE PACKING problem we want to maximize the number 
of connected components (i.e., cycles) in the solution while the approaches 
given in this section work only for problems where we are to minimize the 
number of connected components. In fact one can show that under the Expo- 
nential Time Hypothesis (see Chapter{14) it is impossible to break the k?) 
dependence for CYCLE PACKING, but this is beyond the scope of this book. 


Exercises 


Unless otherwise noted, in the exercises assume a tree decomposition of width k is given. 


11.1. Recall that in the INDUCED MATCHING problem we are asked if there is a subset of 
2£ vertices in G inducing a matching on £ edges. Show that INDUcED MATCHING is solvable 
in time 3*n©, 


11.2. In the Tota DOMINATING SET problem we want to decide whether there is a set 
X C V(G) of size at most £ such that each vertex in G (in particular each vertex in X) 
has a neighbor in X. Show that TotTaL DOMINATING SET is solvable in time Akn OQ), 


11.3. Show how to obtain 3*k°()n running time for the DOMINATING SET problem (note 
the linear dependency on n). 


11.4 (Z). Give a 2*n°()-time algorithm computing the cardinality of the set R defined 


by E1}. 


11.5. Give a 3*nOC()-time algorithm computing the cardinality of the set C defined 


by (17.3). 


11.6. Show a one-sided error Monte Carlo 3*n©)-time algorithm with false negatives 
solving the CONNECTED VERTEX COVER problem. 


11.7. Show a one-sided error Monte Carlo 4*n©)-time algorithm with false negatives 
solving the CONNECTED DOMINATING SET problem. 


11.8 (2). Show a one-sided error Monte Carlo 3°n°()-time algorithm with false negatives 
solving the FEEDBACK VERTEX SET problem. 


11.9 (È). Show a one-sided error Monte Carlo 3'n©)-time algorithm with false negatives 
solving the FEEDBACK VERTEX SET problem parameterized by the solution size (i.e., when 
k is the number of vertices we are allowed to remove). 


11.10 ($). Give a 3*n© time Monte Carlo algorithm for STEINER TREE that applies 
the Schwartz-Zippel lemma instead of the isolation lemma. 


11.11 (8). Give a 2*n°@)-time algorithm for the LoncEesT PATH problem parameterized 
by the length of the path (i.e., here k denotes the length of the path, and no tree decom- 
position is given) that uses the isolation lemma instead of the Schwartz-Zippel lemma. 
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11.12. Show that there is an algorithm that given a set of partitions A C H(U) and 
a weight function w : A> N in time |AJOM2°(1UD finds a set A’ C A of size at most 
2IUl-1, such that for any P € A and R € II(U) with P U R = {U}, there exists a partition 
Q E€ A’, such that Q U R = {U} and w(Q) < w(P). 


11.13. In the weighted variant of the STEINER TREE problem each instance is additionally 
equipped with a weight function w : E(G) > N and we are to find a minimum cost Steiner 
tree. 

Show an algorithm solving this problem in time 20(*)n (note linear dependence on n). 


Hints 


Each vertex can be in one of three possible states: 


e already matched, 
e decided to be matched, but not yet matched, 
e decided not to be matched. 


Each vertex can be in one of four possible states, depending on whether it is chosen 
to a solution or not, and whether it already has a neighbor in a solution or not. 

Consider a join node t with children tı and tg. For a fixed state at t, the interesting pairs 
of states at tı and t2 keep the same choice whether a vertex in a bag is in the solution 
or not, whereas the sets of already dominated vertices need to satisfy a certain subset 
convolution formula. 


Show a way to narrow the maximum possible difference between values c[t, f] for 
a node t over all functions f to O(k). Then argue that we can use the min-sum semiring 
with values bounded by O(k). 


In a state of dynamic programming keep the subset of vertices selected to H as well 
as the number of edges added so far to H. 


Let X be a feedback vertex set we are looking for, and assume |X| = £. The first 
step is to guess c, the number of connected components of G — X. Then, G — X hasn — £ 
vertices and n—£—c edges. Our goal is to use Cut & Count and cuts of the graph G—X to 
ensure that G— X has at most c connected components, while, in separate accumulators 
of the dynamic programming, keeping track of the number of vertices and edges of G — X. 

However, there is a significant challenge to generalize the approach from the STEINER 
TREE example, where we want the solution graph to have one connected component, to 
our case of at most c connected components. To cope with this issue, we introduce a set 
M of c markers, which will serve the role of the vertex vı in the STEINER TREE example. 
More precisely, we define C to be a family of triples (X, M,C), where: 


1. |xX|=4 

2. G—X has exactly n — £ — c edges; 

3. MC V(G)\ X and |M| = c; 

4. C = (Vt, V?) is a cut of G — X, consistent with G — X, such that M C Vt. 


It is a relatively standard task to design a dynamic-programming algorithm counting |C| 
in time 3'n©), if a decomposition of width k is provided. An argument similar to the 
one for STEINER TREE shows that, for every sets M,X C V(G) there exists exactly one 
cut C such that (X, M,C) € C if and only if: 


1. |X| =4 
2. G—X is a forest with exactly c connected components; and 
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3. |M|=c and every connected component of G — X contains exactly one vertex of M. 


Moreover, for other pairs M,X C V(G), there is an even number of cuts C such that 
(X,M,C) EC. 

To conclude, we apply the isolation lemma to the universe containing two copies of 
every vertex of V(G); one copy of a vertex v corresponds to taking v into X, and the 
second copy corresponds to taking v into M. 


Use iterative compression together with the solution to Exercise {11.8 


11.10] For every edge e € E(G) create a variable xe and for every vertex v € V(G) create 
a variable y,. With every subgraph H € R associate a monomial p(H) = Icea) Le: 
Ievan Yv. Design a dynamic-programming algorithm to compute, within the promised 
time bound, the value of the polynomial DVur,cyec p(H), given fixed values of the input 
variables. Argue, as in the case for STEINER TREE, that in a field of characteristic 2 the 
aforementioned polynomial is nonzero if and only if the input instance is a yes-instance. 
Finally, use the Schwartz-Zippel lemma to get a Monte Carlo algorithm. 


11.11] Proceed as in Section|10.4.1] Without loss of generality, assume the input graph is 
directed. Count pairs (W, £), where W is a walk with k vertices and £ : [k] — [k] is a bijec- 


tion. Randomly choose weights w(e) for every edge e € E(G) and w(v, i) for every vertex 
v € V(G) and i € [k]. To a pair (W, £) assign a weight >> cew w(e) + iE, w(i &(4)), 
where v; is the i-th vertex on the walk W, and count the parity of the number of pairs 
(W, £) of every fixed weight. 


11.12] Instead of finding an arbitrary basis of C it is enough to find a minimum weight 
basis. 


11.13] Use the modified Theorem}11.11]from Exercise|11.12| remove the weight accumu- 


lator į from the definition of a state of the dynamic programming and consider a family of 
weighted partitions for a fixed pair (t, X). 
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running times for HAMILTONIAN CYCLE. 

For CycLe Packine on planar graphs a single exponential (in treewidth) algorithm is 
given by Baste and Sau [25]. 

The isolation lemma in the form presented in this chapter comes from Mulmuley, Vazi- 
rani and Vazirani [370], but the first discovery of the isolation lemma is credited to Valiant 
and Vazirani [424]. 

The currently fastest randomized algorithms (in terms of the dependence on treewidth) 
for STEINER TREE and FEEDBACK VERTEX SET run in time 3*n°@) [178], and the con- 
stant 3 cannot be improved under the Strong Exponential Time Hypothesis. For HAMIL- 
TONIAN CYCLE parameterized by pathwidth the currently fastest algorithm runs in time 
(2+2) nO ® (surprisingly, this bound is also tight under SETH), while for treewidth 
the currently best running time is 4*nO@ [IIB]. 


When restricted to deterministic algorithms parameterized by treewidth, the best run- 
ning time for STEINER TREE and FEEDBACK VERTEX SET is (1+ 2%7!-3)*nO@ [Oi]. 


Chapter 12 
Matroids 


Tools and techniques from matroid theory have proven 
very useful in the design of parameterized algorithms 
and kernels. In this chapter, we showcase how a 
polynomial-time algorithm for MaTROID PARITY can 
be used as a subroutine in an FPT algorithm for 
FEEDBACK VERTEX SET. We then define the notion of 
representative sets, and use it to give polynomial ker- 
nels for d-HiTT1ING SET and d-SET PACKING, and fast 
(deterministic) FPT algorithms for LONGEST PATH 
and Lona DIRECTED CYCLE. 





Matroid theory unifies and generalizes common phenomena in two seem- 
ingly unrelated areas: graph theory and linear algebra. For example, in a 
certain technical sense, the notion of linear independence in a vector space 
and acyclicity in graphs behave very similarly. As we shall see, the connec- 
tion between the two areas can be exploited algorithmically by translating 
graph-theoretic problems into the language of linear algebra and then we can 
use efficient algebraic methods to solve them. 

Many of the problems considered in this book are problems on graphs, or 
on set systems, where input is a finite universe U and a family F of sets over 
U. A family F of sets over a finite universe U is a matroid if it satisfies the 
following three matroid axioms: 


e Jef, 

e if A€ Fand BC Athen BEF, 

e if A€ Fand B € F and |A| < |B| then there is an element b € B\ A 
such that (AU {b}) E€ F. 


The second property is called the hereditary property of matroids, while 
the third is called the exchange property. 
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Just like graphs, matroids have been studied extensively, and there are 
many connections between matroid theory and graph theory. A thorough 
introduction to matroid theory is beyond the scope of this book, the aim 
of this chapter is to showcase how some tools and techniques from matroid 
theory can be used to design parameterized algorithms. 

The first example is about the FEEDBACK VERTEX SET problem. Here, 
we are given a graph G and integer k, and asked whether there exists a set S 
on at most k vertices such that G—S is acyclic. The problem is NP-complete. 
However, quite surprisingly, it turns out to be polynomial-time solvable on 
graphs of maximum degree 3 by a reduction to the so-called MATROID PAR- 
ITY problem. This claim, proved in Section is the missing piece of 
the 3.619*nO\-time algorithm for FEEDBACK VERTEX SET described in 
Section 

Perhaps the most dramatic examples of the use of matroid theory in pa- 
rameterized algorithms are the randomized polynomial kernels for ODD Cy- 
CLE TRANSVERSAL and ALMOST 2-SAT by Kratsch and Wahlström [312 
[310]. The main ingredient from matroid theory in the kernelization algorithm 
for ALMOST 2-SAT is the concept of representative sets. Unfortunately the 
kernelization algorithms for ODD CYCLE TRANSVERSAL and ALMOST 2-SAT 
are quite complex, so we do not cover them in this book. Instead, we show 
how representative sets can be used to give polynomial kernels for d-HITTING 
SET and d-SET PACKING. 

It turns out that representative sets are useful not only for designing ker- 
nelization algorithms, but also for designing fast FPT algorithms. We give 
two examples of fast parameterized algorithms that use representative sets; 
the first is a 6.75°+°(%) nO) _time algorithm for the LONG DIRECTED CY- 
CLE, the second a 2.619*nO“)-time algorithm for LONGEST PATH in directed 
graphs. 

This chapter assumes that the reader has some basic knowledge of linear 
algebra. We will freely make use of such concepts as linear independence, the 
span of a set of vectors, basis vectors, dimension of a space and determinants. 
We will also use finite fields, but will restrict ourselves to the finite fields 
of prime size, that is the integers modulo a prime p. We refer you to the 
textbook for an introduction to linear algebra. 

Before we start, we define some of the basic notions from matroid theory. 
Note that, as matroid theory generalizes concepts from both graph theory 
and linear algebra, it borrows terms from both of these areas. If (U, F) is 
a matroid, then the elements of U are called the edges of the matroid, the 
sets S € F are called independent sets|}| An inclusion-maximal independent 
set S is a basis of the matroid, while the inclusion-minimal non-independent 
sets are called circuits. From the exchange property of matroids, it follows 





1 It should be emphasized that the term “independent” here alludes to linear independence 
of vectors and has nothing to do with independent sets in graphs. In particular, the family 
of independent sets of a graph is not necessarily a matroid (Exercise|12.1). 
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directly that all the bases of a matroid (U, F) have the same size; this size is 
referred to as the rank of the matroid. 


12.1 Classes of matroids 


Often the matroids that show up in applications have a particular structure. 
We will now look at some of the most common ones. The most basic class of 
matroids are the uniform matroids. A uniform matroid is completely defined 
by the universe U and its rank r, as every subset S of the universe of size 
at most r is independent. It is easy to verify that uniform matroids indeed 
satisfy the matroid axioms. 


12.1.1 Linear matroids and matroid representation 


One of the main purposes of matroids is to serve as a “combinatorial” gen- 
eralization of the notion of linear independence of vectors. Linear matroids 
are the matroids that can be defined using linear independence. Let U be a 
set, and assign to every e € U a vector ve over some field. The vectors for 
the different elements of the universe should all be over the same field F and 
have the same dimension. We now define a family F of independent sets as 
follows. A set S C U is in F if and only if the set {ve : e € S} forms a 
linearly independent set of vectors. 


Lemma 12.1. (U,F) is a matroid. 


Proof. We need to verify that (U, F) satisfies the matroid axioms. Clearly, 
Ø € F. Furthermore, a subset of linearly independent vectors is also linearly 
independent, therefore (U, F) satisfies the hereditary property. It remains to 
prove that (U, F) satisfies the exchange property. 

Let A and B be sets in F such that |A| < |B|. Then the set {ve : e € A} 
spans a space of dimension |A| while {ve : e € B} spans a space of dimension 
|B|. It follows that there is an element b € B such that vp is not in the span 
of {ve : e € A}, and so (AU {b}) € F. 














We define a matrix M with one column for each edge e € U such that 
the column vector of e is ve. The matrix M is said to represent the matroid 
(U, F) over F. A matroid M = (U, F) is called representable over F if there 
is a matrix over F that represents M. A matroid that is representable over 
the field GF(2), that is, the integers modulo 2, is called a binary matroid. A 
matroid is called representable or linear if it is representable over some field. 
Even though not all matroids are representable, all the matroids that we will 
be working with in this book are. 
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Row operations, i.e., multiplying a row by a nonzero scalar, or adding one 
row to another do not affect the linear independence of columns. Therefore, 
we can perform row operations on a matrix M representing a matroid M, 
and the resulting matrix will still represent the same matroid. Thus, if the 
rank of M is r, we can, by means of row operations, modify M such that 
the top r rows are linearly independent and all the remaining rows have 
zeroes everywhere. We can then simply delete the rows that are all 0, since 
they do not affect the linear independence of columns anyway. Hence, for a 
representable matroid of rank r, we can always ensure that the representation 
matrix has exactly r rows. If for some reason we are given a representation 
matrix with more rows, we can find a representation matrix with r rows in 
polynomial time by means of Gaussian elimination. 


In most situations we may assume without loss of generality that the 
representation matrix of a matroid of rank r has exactly r rows. 


12.1.2 Representation of uniform matroids 


We will now show that a uniform matroid with a universe U of size n and 
rank k can be represented over any field GF(p) for p > n. Let the elements 
of the universe be numbered as U = e1, €2,...,€n- Assign to each element e; 
a unique nonzero field element a;. We set the vector v; of the element e; to 
be the k-dimensional vector (1,a},a?,...,a*~1). 

Since the vectors are k-dimensional, it follows that for any set A C U of 
size more than k, the set {v; : e; E€ A} is not linearly independent. We now 
show that for any set A C U of size k, the set {v; : e; E€ A} is linearly 
independent. It suffices to show that the matrix Ma with column vectors 
{v; : e; E€ A} has a nonzero determinant. The matrix M, is a Vandermonde 
matrix [270], and hence the determinant of M4 is equal to 


II Qj — Qi. 


O<i<j<n 
e:€A,ejEA 


Since this is a product of nonzero field elements, it follows that the deter- 
minant of M, is nonzero, and hence {v; : e; € A} is a set of linearly 
independent. vectors. 
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12.1.3 Graphic matroids 


The graphic matroids are matroids that arise from graphs in the following 
way. The graphic matroid Mg of an undirected graph G has universe E(G) 
and a set S C E(G) is independent if the subgraph (V(G),S) is acyclic. 
Many of the terms used with general matroids have been coined with graphic 
matroids in mind. For example, the edges of Mg are just the edges of G. 
The circuits of Mg are exactly the cycles of G. 

It is not immediately obvious that for every graph G, Mg is actually 
a matroid. It is easy to see that the empty set is independent and that 
the hereditary property always holds, but the exchange property is a little 
trickier. If A C E(G) is independent and B C E(G) is independent, and 
|A| < |B|, which element b € B can we pick so that AU {b} is independent? 
Consider a connected component C of (V (G), A). As (V(G), A) is acyclic, if B 
has more edges inside V(C) than A, then there is a cycle in B, contradicting 
that B is independent. Therefore, in each component of (V(G), A), the graph 
(V(G), B) contains at most as many edges as A does. Since |A| < |B| it 
follows that there is an edge b € B whose endpoints are in different connected 
components of (V(G), A). But then AU {b} is independent, and hence Mg 
is a matroid. 


To represent the graphic matroid of a graph G make a matrix with a 
column for each edge and a row for each vertex. The entry corresponding 
to a vertex v and an edge e is 0 if v is not incident to e, and either 1 or 
—1 if it is incident. Each column should contain exactly one 1 and one 
—1. 


Theorem 12.2. Graphic matroids are representable over any field. 


Proof. We give a constructive proof, i.e., an algorithm that given an undi- 
rected graph G computes a representation of the graphic matroid Mg. We 
start by turning G into a directed graph D by arbitrarily orienting each edge. 
The representation of Mg is a matrix M with one column ve for each edge 
e € E(G) and a row for each vertex. We set all the entries of ve to 0 except 
in the two entries that are in the rows corresponding to the endpoints u and 
v of e. If the edge e is oriented from u to v in D, we set the entry of ve 
corresponding to u to —1 and the entry corresponding to v to 1. If the edge 
is oriented from v to u, we set the entry of ve corresponding to u to 1 and 
the entry corresponding to v to —1. 

It remains to show that a set A C E(G) is acyclic if and only if the 
set of vectors {ve : e € A} is linearly independent. We first show the 
forward direction. Suppose for contradiction that A C E(G) is acyclic but 
{ve : e € A} is not linearly independent. Then there exists a set of values 
{ae : e E€ A} such that at least one of the values a, is nonzero, and 
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cca Weve = 0. Let A’ C A be the set of edges e in A such that a, is nonzero. 
Since A is acyclic, A’ is acyclic as well, and hence there exists a vertex u that 
is incident to exactly one edge é in A’. But then the entry of J ec 4 WeVe 
corresponding to u must be nonzero, as the only nonzero contribution to the 
sum comes from ae - ve. This contradicts } <4 Weve = 0. 

For the reverse direction, suppose that A C E(G) contains a cycle C. We 
will construct a set of values {œe : e € A} such that at least one of the values 
Qe is nonzero, and J} e4 MeVe = 0. This shows that the set {ve : e € A} is 
linearly dependent. Let A’ be the set of edges of the cycle C. We set ae = 0 
for every edge e ¢ C. Consider a walk around the cycle C. For each edge 
e € A’ we set the variable a, to 1 if the arc in D corresponding to e is 
oriented in the direction of the walk, and to —1 otherwise. Consider now the 
sum Jeca VeVe- Any entry corresponding to a vertex not on the cycle C is 0. 
For the entry corresponding to a vertex u on the cycle C, the sum J oe 4 WeVe 
has two nonzero terms, one for each edge incident to u on C. Let e; be the 
edge by which the walk enters u, and e2 be the edge by which the walk leaves 
u. It is easily verified that the term in the sum corresponding to e; is 1 and 
the term corresponding to ez is —1. Thus }).- 4 Qeve = 0, completing the 
proof. 














In particular, Theorem [12.2]implies that graphic matroids are binary ma- 
troids. Observe that if we are working with GF(2) then —1 = 1. Thus the 
representation matrix for the graphic matroid of G in GF(2) is just a matrix 
with a row for each vertex, a column for each edge, and the entry correspond- 
ing to a vertex u and edge e is 1 if the edge e is incident to v. This matrix is 
called the vertex-edge incidence matrix. 


12.1.4 Transversal matroids 


Transversal matroids are another class of matroids that arise from graphs. 
For a bipartite graph G = (UUB, E) with all edges between U and B, we can 
define a matroid M with universe U, and a set S C U is independent if there 
exists a matching in G such that every vertex in S is an endpoint of a matching 
edge. It is easy to see that M satisfies Ø € F and the hereditary property; 
however it is not at all obvious that M satisfies the exchange property. 

In Exercises |12.2 we will consider two different approaches for prov- 
ing that M is a matroid. One approach is to show that M satisfies the 
exchange property using a flow argument. Another approach is to give a 
representation for M. 


Theorem 12.3. There is a polynomial-time randomized algorithm that, given 
as input a bipartite graph G = (U U B, E) and a positive integer x, outputs a 
matrix A of integers between 0 and x-n-2". With success probability at least 
1 — 1/x, the matriz A is a representation of the transversal matroid M of 
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G over the rationals. If the algorithm fails, then A represents a matroid M' 
with universe U such that every independent set of M’ is independent in M. 


Theorem [12.3] proves that transversal matroids are in fact matroids: given 
any bipartite graph G = (UUB, E) the algorithm will output a representation 
of M with positive probability. Note that the integers in the representation 
matrix A can be exponential in n; however they still only require O(n) bits 
to be stored. Since the basic operations on integers, such as addition and 
multiplication, can be done in time polynomial in the size of the bit repre- 
sentation, the overhead caused from working with the integers in A is only 
polynomial in n. Even though the proof of Theorem [12.3] is quite easy (see 
Exercise[12.3), it remains a challenging open problem to design a determinis- 
tic polynomial-time algorithm that, given as input G = (U U B, E), outputs 
a representation of the corresponding transversal matroid M. 


12.1.5 Direct sum and partition matroids 


The direct sum of two matroids is an operation that takes two matroids Mı 
and Mpg over disjoint universes U and U2 respectively, and produces a new 
matroid M over U1 UU. A subset S of U; UU» is independent in M if SN U1 
is independent in M, and S N Up is independent in Mo. It is easy to verify 
that M satisfies the matroid axioms. 

If Mı and Mp can be represented over the same field GF(p), then their 
direct sum M can also be represented over GF (p). In particular if the matrix 
A, represents M, and A» represents Mz, then the matrix 


A, 0 
0 Ap 
represents M. 


Partition matroids are the matroids that can be written as disjoint sums 
of uniform matroids. That is, in the universe U is partitioned into classes 
U;,...,Ux with an integer a; associated to each class U;, and a set S C U is 
independent in the partition matroid if |S N U;| < a; for every i € [k]. The 
representation of uniform matroids described in Section [12.1.2] together with 
the above paragraph, gives a way to represent any partition matroid with 
edge set U over any field of size at least max;<,(|U;| + 1). 


12.2 Algorithms for matroid problems 


What are the most common computational problems about matroids and how 
efficiently can we solve these problems? One kind of question we have already 
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seen a few examples of is, given as input a graph or a different structure that 
implicitly defines a matroid M, compute a representation of M. Sometimes 
we need to compute the representation of M in a particular field GF(p) or 
determine that such a representation does not exist. We will not be dwelling 
on these kinds of questions beyond what has already been discussed in the 
previous sections. 

Another type of problem is when a matroid M is given as input and 
we have to answer some question about it. When discussing computational 
problems on matroids, we have to define first how the matroid is described in 
the input. For example, we may assume that the matroid is given by listing all 
independent sets /bases, given implicitly (e.g., as the transversal matroid of a 
bipartite graph), or we can consider a setting where the matroid is given by 
an oracle capable of answering whether a set is independent. In this chapter, 
we consider computational problems exclusively on linear matroids and we 
assume that the representation matrix A of the matroid M is given as input. 

Possibly the most fundamental question is where we are given a set S and 
asked whether the set is independent in the matroid M. This can easily be 
done in polynomial time using Gaussian elimination. Using Gaussian elimi- 
nation we can also find in polynomial time a column basis for the matrix A. 
This corresponds to a maximum size independent set (or basis) in the matroid 
M and therefore we can determine the rank of M. Sometimes we want to 
find a largest possible, or a smallest possible circuit. Sometimes we are given 
a weight function w : U — Z on the universe, and the task is to find the 
basis (or circuit) S that maximizes (or minimizes) the total weight of S. We 
will denote the weight of an element u € U by w(u) and the weight of a set 
S C U by w(S) = „es w(u). The easiest of these optimization problems is 
to find a basis of minimum weight. Here, a simple greedy algorithm does the 
trick. 


To find a basis of minimum weight, start with S = Ø and repeatedly 
extend S by the smallest weight element whose addition to S keeps S 
independent. 


Formally, we start with Sg = Ø. Then, for each i > 1, let S; = Sj_1 U 
{u} where u ¢ S is the minimum weight element such that Si—ı U {u} is 
independent. Let r be the rank of M. Since all maximal independent sets of 
M have size r, it follows that the algorithm will be able to find an independent 
set S,, but unable to find any u such that S} U {u} is independent. The 
algorithm outputs the set S,. We claim that this set is actually the basis of 
minimum weight. 


Theorem 12.4. The greedy algorithm computes a basis S, of minimum 
weight. 
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Proof. Let X be a basis of M of minimum weight. Order the elements of 
X into £1,%2,...,2, by nondecreasing weight. For each i < r, define X; = 
{x1,...,2;}. We prove by induction that the weight of S; is at most the 
weight of X;. For the base case, the weight of both So and Xo is 0. For the 
inductive step, we prove that the weight of S; is at most that of X;, assuming 
that the statement holds for all j < i. 

Let u be the element the algorithm added to S;_; to obtain S;. By the 
exchange property, there is an element v € X; such that S;—1ı U {v} is inde- 
pendent. Since the algorithm picked u over v, it follows that the weight of u 
is at most the weight of v. Furthermore, since the elements x; are ordered by 
nondecreasing weight and v € X;, we have w(v) < w(a;). Thus 


(in the second inequality, we used the induction hypothesis). This concludes 
the proof. 














We leave it as an exercise (see Exercise {12.4} to prove that an analogous 
greedy algorithm can be used to find a basis of maximum weight. While the 
optimization questions about bases are computationally tractable, the ones 
about circuits are not. For example, circuits in the graphic matroid Mg of 
a graph G are exactly the cycles of G. Thus, determining whether Mg has 
a circuit of length at least n is equivalent to the NP-complete HAMILTONIAN 
CYCLE problem [224]. Since graphic matroids are binary matroids, it follows 
that finding the largest circuit is NP-complete for binary matroids. In graphic 
matroids finding the smallest circuit corresponds to finding the shortest cycle 
in a graph G. This can easily be done in polynomial time. On the other 
hand, finding the smallest circuit of a binary matroid can easily be seen to 
be equivalent to the following NP-complete problem, called EvEN SET. Here, 
input is a family F of sets over a universe U and an integer k. The task 
is to find a nonempty subset S of U of size at most k such that for every 
set X € F, |X S| is even. To see that EVEN SET is equivalent to finding 
the smallest circuit of a binary matroid, consider the set-element incidence 
matrix A of U and F. Here, every column corresponds to an element u € U, 
every row to a set X € F and the entry in the column of u and row of X is 
1 if u € X. The matrix A represents a matroid where the circuits are exactly 
the minimal subsets S$ of U such that for every set X € F, |X S| is even. 

Since the smallest /largest circuit problems are NP-hard, it is interesting 
to investigate their parameterized complexity. Determining whether a given 
transversal matroid has a circuit of size at most k is equivalent to asking 
whether a given bipartite graph G with bipartitions U and B contains a 
set S C U of size at most k such that |N(S)| < |S]. In Exercise [13.28] the 
reader is asked to show that this problem is in fact W[1|-hard, and therefore 
unlikely to admit an FPT algorithm. For binary matroids finding a circuit 
of size at most k is equivalent to the EVEN SET problem, and determining 
whether EVEN SET admits an FPT algorithm remains a challenging open 
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problem. For the parameterized complexity of problems related to EVEN SET, 
see Chapter On the other hand, one may use matroid minor theory to 
show that the problem of finding a circuit of size at least k is fixed parameter 
tractable for all matroids representable over a fixed finite field [262]. The 
parameterized complexity of this problem on matroids representable over the 
integers remains open. 


12.2.1 Matroid intersection and matroid parity 


Since it is easy to find the minimum or maximum weight independent set in 
a matroid in polynomial time, it is natural to ask whether polynomial-time 
algorithms could be achieved for generalizations of this problem. A possible 
generalization is to have multiple matroids over the same universe, and look 
for a minimum (or maximum) weight set which is simultaneously independent 
in all of the matroids. 

For a fixed integer @, we define the  MATROID INTERSECTION problem as 
follows. Input consists of a universe U and £ matrices representing matroids 
Mı, Mo,...Me over U, and an integer k. The task is to determine whether 
there exists a set S C U of size at least k that is independent in all of the 
matroids M;, i < £. 

For two set systems Sı = (U, F1) and S2 = (U, F2), we define their inter- 
section as 


S1 N S2 = (U, Fun Fp). 


Consider now the intersection Mı N Mg of two matroids Mı = (U, 7) and 
My = (U, F2). We have that Ø € Fı N Fz and that S € Fı N Fə implies 
that all subsets of S5 also are in Fı N Fo. Thus, if Mı N Mə would also 
satisfy the exchange axiom, then Mı NMə would be a matroid and we could 
find a maximum weight set which is simultaneously independent in both by 
using the greedy algorithm of Theorem In fact, then we could use the 
greedy algorithm to solve (-MATROID INTERSECTION for any £. However the 
intersection of two matroids is not necessarily a matroid, as the reader is 
asked to prove in Exercise On the other hand, the intersection of a 
matroid with a uniform matroid is always a matroid; this operation is called 
the truncation of the matroid. In particular, in Exercise the reader is 
asked to prove the following lemma. 


Lemma 12.5. If M = (U,F) is a matroid of rank at least r and U, is the 
uniform matroid with edge set U and rank r, then MOU, is a matroid of 
rank r. Furthermore there is a polynomial-time randomized algorithm. that, 
given integers x > 2 and r, and a representation of M over GF(p) with 
p>a«-r-|U|", outputs an r x |U| matriz A. With probability at least 1—1/x 
the matriz A is a representation of MOU over GF (p). Otherwise A represents 
a matroid M’ such that all independent sets in M’ are independent in MAU. 
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Thus we are still left with the question of the computational complexity 
of (-MATROID INTERSECTION. The answer turns out to be rather surprising; 
the problem is polynomial time solvable for @ = 2, but NP-complete for larger 
values of £. We start with showing the hardness proof. 


Theorem 12.6. 3-MATROID INTERSECTION is NP-complete. 


Proof. Membership in NP is trivial, and so we prove hardness. We show 
hardness by reduction from the HAMILTONIAN PATH problem in bipartite 
graphs. Here, input is a bipartite graph G with bipartitions Vz and Vr. The 
task is to determine whether there exists a path that visits every vertex of G 
exactly once. It is well known that HAMILTONIAN PATH remains NP-complete 
even when the input graph is required to be bipartite [224]. 

We make three matroids, Mz, Mp and Me, all with the edge set E(G) as 
universe. Mz is a partition matroid where a subset A C E(G) is independent 
if every vertex in Vz is incident to at most two edges in A. Similarly, M p is a 
partition matroid where a subset A C E(G) is independent if every vertex in 
Ve is incident to at most two edges in A. Finally Mg is the graphic matroid 
of G. 

We claim that an edge set A C E(G) of size at least n — 1 is independent 
in Mz, Mp and Mg if and only if A is the edge set of a path in G that 
visits every vertex exactly once. Indeed, since A is independent in Mg, the 
set A is acyclic. Since A has size at least n — 1 it follows that (V(G), A) is a 
spanning tree T of G. Since A is independent in both Mz, Mp it follows that 
the maximum degree of T is 2. However a spanning tree of maximum degree 
2 is a path visiting each vertex exactly once. This concludes the proof. 














A polynomial-time algorithm for the 2-MATROID INTERSECTION problem 
follows from an algorithm for a slightly more general problem called MATROID 
PARITY. Here, we are given as input a matrix A representing a matroid 
M = (U,F), a collection P C (5) of pairs of elements of U, and an integer 
k. The objective is to find a subset S C P of size exactly k such that the set 
Uxes X has size 2k and is independent in M. Note that the size constraint 
is equivalent to demanding that all pairs in S be pairwise disjoint. Thus it is 
sufficient to find a subset S C P of size at least k such that all pairs in S are 
pairwise disjoint and the set [yes X is independent in M. A polynomial- 
time algorithm for MATROID PARITY is out of scope of this book; we will 
simply use the existence of such an algorithm as a black box. 


Theorem 12.7. MATROID PARITY admits a polynomial-time algorithm. 


In Exercise the reader is asked to show how Theorem [12.7] implies a 
polynomial-time algorithm for 2- MATROID INTERSECTION. 

It is worth noting that MATROID PARITY is a more complex problem 
than 2-MATROID INTERSECTION. While a polynomial time deterministic al- 
gorithm for 2-MATROID INTERSECTION can be found in textbooks and 
is often covered in courses on combinatorial optimization, the determinis- 
tic algorithms for MATROID PARITY are quite complex. A possible reason 
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is that 2- MATROID INTERSECTION generalizes BIPARTITE MATCHING, while 
MATROID PARITY can be seen as a generalization of MAXIMUM MATCHING. 

In the MAXIMUM MATCHING problem we are given as input a graph G and 
asked to find a largest possible matching in G, i.e. a set S C E(G) such that 
no two edges in S share a common endpoint. In the BIPARTITE MATCHING 
problem the input graph G is required to be bipartite. BIPARTITE MATCHING 
can easily be solved in polynomial time by a reduction to MAXIMUM FLOW 
(if you have not yet seen this reduction, try making one yourself). MAXIMUM 
MATCHING is also known to be solvable in polynomial time [412], but the 
algorithms for this problem are more advanced. 

BIPARTITE MATCHING reduces to 2-MATROID INTERSECTION in the fol- 
lowing way. Let G be the input bipartite graph with bipartitions A and B. 
We make two matroids, M4 and Mpg, both with edge set E(G). An edge 
subset S of E(G) is independent in M 4 if no vertex of A is incident to more 
than one edge of S. Similarly, an edge subset S of E(G) is independent in 
M p if no vertex of B is incident to more than one edge of S. Note that both 
Ma and Mp are, by construction, partition matroids. Further, a set S' of 
edges is simultaneously independent in M 4 and in Mp if and only if Sisa 
matching in G. 

We now reduce MAXIMUM MATCHING to MATROID PARITY. Given a 
graph G we make a matroid M whose universe is V(G), and every vertex 
subset is independent. From the edge set E(G) we make the collection P of 
pairs in the MATROID PARITY instance in the most natural way; each edge 
uv corresponds to the pair {u,v} € P. Now, a subset S of edges is a matching 
if and only if the corresponding subset S C P satisfies that all pairs in S are 
pairwise disjoint, and that |J xes X is independent in M. Here, the second 
part of the condition on S always holds. 


2-MATROID INTERSECTION generalizes BIPARTITE MATCHING, while 
MATROID PARITY generalizes MAXIMUM MATCHING and 2-MATROID 
INTERSECTION. MATROID PARITY is solvable in polynomial time, and 
hence the other three problems are as well. 


Another important difference between 2-MATROID INTERSECTION and 
MATROID PARITY is that the two problems behave very differently in the 
oracle model. In the oracle model the input matroids are not required to be 
linear, and the algorithm is not given any representation of the matroids as 
input. The only way an algorithm may receive information about the input 
matroids is by making queries on the form “is the edge set A independent?”. 
2-MATROID INTERSECTION remains polynomial time solvable in the oracle 
model [161], while MATROID PARITY requires an exponential number of 
oracle calls [332]. 
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12.2.2 FEEDBACK VERTEX SET in subcubic graphs 


Recall the FEEDBACK VERTEX SET problem, where we are given as input a 
graph G and an integer k, and the task is to find a subset S of V(G) such 
that G—S is acyclic. In Chapter [4| we gave a 3.619% n0 ®-time algorithm for 
FEEDBACK VERTEX SET, under the assumption that a variant of this problem 
is solvable in polynomial time. Specifically, in the SPECIAL DISJOINT FVS 
problem, we are given as input a graph G and a vertex set W C V(G) such 
that G — W is independent and every vertex not in W has degree at most 
3. The objective is to find a smallest possible set S C V (G) \ W such that 
G — S is a forest. In Chapter [4] we stated without proof a lemma claiming 
that SPECIAL DISJOINT FVS can be solved in polynomial time. We now 
substantiate this claim. 


Lemma (rephrased). SPECIAL DISJOINT FVS has a polynomial-time 
algorithm. 

We prove the lemma here under the assumption that the input graph G is 
simple. Observe that when performing the reductions in Chapter [f] one may 
create double edges, making the considered graph a multigraph. However, 
when Lemma [4.T0]is invoked, any double edge of G would have one endpoint 
in W and one outside. Then the endpoint outside of W would have been 
deleted by Reduction Rule Thus it is sufficient to prove the lemma 
for simple graphs. 

We start by applying some (but not all) of the reduction rules described 
in Section in order to get a slightly more structured instance. We will 
use the following reduction rules from Section 


e Reduction |FVS*.1} Delete all the vertices of degree at most 1 of G. 


e Reduction 2| If there exists a vertex v in V(G)\W such that G[W U 
{v}] contains a cycle, then include v in the solution and solve the problem 
on G—v. 


We introduce a few more reduction rules. 


Reduction FVS*.5. If G[W] contains a cycle, then return no. 


Reduction FVS*.6. Suppose there is an edge uv with both endpoints in 
W. Obtain G” from G by contracting the edge uv. In graph G’, we put the 
vertex resulting from the contraction into the set W. Solve the problem on 
G. 


Safeness of Rule[FVS*.5ļ|is trivial, so we only discuss Rule[FVS*.6| We only 
apply Rule |FVS*.6ļif Rules|FVS*.2}and|FVS*.5|cannot be applied. Thus the 


contraction of uv does not create any self-loops or double edges, keeping the 
graph simple. Next we prove that Rule |FVS*.6]is safe. 


Lemma 12.8. Rule|F'VS*.6|is safe. 
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Proof. We claim that for any set S C V(G) \ W, graph G — S is acyclic if 
and only if G’ — S is. If G— S is acyclic, then observe that G’ — S is a minor 
of G — S. Since a minor of a forest is a forest, G” — S is acyclic as well. Now 
assume that G — S contains a cycle C; pick a shortest such cycle. If C does 
not contain both u and v, then C is a cycle in G’ — S as well. If C contains 
both u and v, then u and v appear consecutively on C, since C is a shortest 
cycle. In this case, contracting the edge uv changes C into a new cycle C” of 
length 1 less than C. Since the contraction of wv does not make any double 
edges it follows that the length of C is at least 4, and hence C” is a simple 
cycle in G’ — S. 














If we start with an instance of SPECIAL DISJOINT FVS and exhaustively 
apply the reduction rules above, we arrive at an instance of (G,W) of the 
problem where G[W] and G — W are both independent sets, no vertex has 
degree less than 2, and all vertices of V(G)\W have degree at most 3. Next we 
get rid of the vertices in V(G)\ W of degree 2. Note that the simple argument 
of Reduction [FVS*.3]in Section [4.3] no longer works. We cannot just remove 
the vertex and make its two neighbors adjacent: as the two neighbors are in 
W and hence undeletable, this transformation could turn a yes-instance into 
a no-instance. 


Reduction FVS*.7. If G has a degree 2 vertex v ¢ W, then add v to W. 
Lemma 12.9. Rule is safe. 


Proof. It is sufficient to prove that there exists an optimal solution S$ disjoint 
from the degree 2 vertex v. Let x and y be the neighbors of v. Consider a 
minimum size feedback vertex set S C V(G)\W. If v ¢ S, then we are done. 
Suppose now v € S. Since S \ {v} is not a feedback vertex set of G, there is 
a path between x and y in G — S. Since G — S is acyclic, this path is unique, 
and since W is independent, the path contains a vertex v’ ¢ W. We claim 
that S = (SU {v’}) \ {v} is also a feedback vertex set. Indeed, any cycle in 
G — S’ must contain v and a path between x and y disjoint from v. But the 
path between x and y in G—S is unique and contains v’, contradicting that 
the cycle was disjoint from S”. 


Let us note that we only apply Rule when all the preceding 
rules may not be applied. Furthermore, note that after an application of 
Rule G[W] is no longer independent and thus some of the other 
rules will apply. It is important that we apply Rule [FVS*.7] one vertex at a 
time, rather than inserting all degree 2 vertices into W simultaneously. In- 
deed, if G was a cycle on four vertices and W contained the first and third 
vertex on the cycle, simultaneously inserting the remaining two vertices into 
W would make a no-instance, even though the original instance has a solution 
of size 1. We are now ready to prove Lemma [4.10] 


Proof (of Lemma . After an exhaustive application of Reductions|FVS*.1 
FVS*.2) |FVS*.5| |FVS*.6| and|FVS*.7| we arrive at an instance of SPECIAL 
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DISJOINT FVS where W is an independent set and every vertex not in W 
has degree exactly 3. We show how to solve such instances in polynomial time 
by giving a reduction to the MATROID PARITY problem. Let H be a clique 
with vertex set W and My be the graphic matroid of H. Let A be a matrix 
representing M p; such a matrix can easily be computed in polynomial time; 
see the proof of Theorem [12.2] 

We will make an instance of MATROID PARITY where the matroid is M p. 
We now make a collection P C ee of pairs of elements of W as follows. For 
each vertex u € V(G) \ W, let x, y and z be the neighbors of u. We add the 
pair P, = {xy, yz} to P. Note that, for two vertices u and v, the pairs P, 
and P, could be identical; we still add both pairs to P. Thus P could be a 
multiset (but of course adding more copies of the same pair to P does not 
change the solvability of the MATROID PARITY instance). If we are looking 
for a solution S of size t to SPECIAL DISJOINT FVS we set k = n—|W|-—t and 
use this value in the instance of MATROID PARITY. To conclude the proof, 
we need to show that G has a feedback vertex set S C V(G) \ W of size at 
most t if and only if there exists a subset Z C P of size at least k such that 
all pairs in Z are disjoint and the set (J yez X is independent in My. 

There is a natural bijection between subsets of V(G)\ W and subsets of P. 
For a set X C V(G)\W we define Zx ={P, : u € X}. Clearly, the function 
that takes X to Zx is a bijection. We claim that for any X C V(G) \ W, the 
graph G|W U X] is acyclic if and only if Up,-z, Pu is independent in the 
matroid M p and all pairs in Zx are disjoint. 

Suppose first the pairs in Zx are not pairwise disjoint. Then there are two 
pairs P, and P, in Zx that both contain the edge xy of H. In this case, 
u,x,v,y forms a cycle on four vertices in G[W U X]. Suppose now that the 
pairs in Zx are pairwise disjoint, but that | P,czx Pu is not independent 
in My. Then this set contains a circuit C = c,co...ce. Here, the c;’s are 
vertices in W. For each 0 < i < £, let u; be the vertex in X such that the 
pair corresponding to u; contains the edge cici+1. Since the pairs are disjoint, 
u; is uniquely defined, and adjacent to both c; and to cj41. Similarly, let ue 
be the vertex in X whose pair contains cegc,. The vertices u1,..., ug are not 
necessarily distinct. On the other hand they cannot all be equal, since C is 
a simple cycle on at least three edges, while each vertex u; corresponds to a 
pair that contains at most two edges of C. Therefore there exists an i such 
that ui A uj41. But then UiCi+1Ui+1 is a simple path on three vertices, and 





Ui+1Ci+2Ui+2Ci+3Ui+3 - - - CLULCIUILC2U2 . . . CiUi 


is a walk from u;+ı to u; in G[W U X] avoiding c;. Consequently, G[W U X] 
contains the walk u;iCi+1ui+1 and a walk back to u; avoiding c;, so it contains 
a cycle. 

For the reverse direction, suppose now that G[W U X] contains a cycle C. 
Since G[WUX] is bipartite, we have that C = c1u1c2u2 . . . ceug, where each c; 
is in W while each w; is in X. Let Zc C Zx be the set of pairs corresponding 
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to {ui,u2,---, ue}, and define Ec = Up, ezo Pu- The set Ec is a set of edges 
in H. If two of the pairs in Zo have nonempty intersections, then we are 
done, so assume that all the pairs in Zc are disjoint. Then |Ec| = 24. We 
now count the vertices in W that are incident to at least one edge in Ec. This 
is all of the vertices c,,c2,...,c¢, plus at most one more vertex for each uj, 
since the pair corresponding to u; is incident to ci, Ci+1 and one more vertex. 
Hence there are at most 2¢ vertices incident to edges in Ec. Any graph on 24 
edges and at most 2l vertices must contain a cycle, and so we conclude that 
the set Ec is not independent in Ma. 

Finally, observe that if G has a feedback vertex set S C V(G) \ W of size 
at most t, then X = V (G) \ (W US) is a set of size at least n — |W] — t = k 
such that G[W U X] is acyclic. Similarly, if X C V(G) \ S is a set of size at 
least k such that G[W U X] is acyclic, then S = V (G) \ (WU X) is a feedback 
vertex set of size at most t. This concludes the proof. 














A nice consequence of Lemma is that FEEDBACK VERTEX SET is 
polynomial time solvable on graphs of maximum degree 3. 


Theorem 12.10. FEEDBACK VERTEX SET admits a polynomial-time algo- 
rithm on graphs of maximum degree 3. 


Proof. Given as input a graph G of maximum degree 3, we make a new graph 
G’ from G by replacing every edge by a degree 2 vertex whose neighbors 
are the endpoints of the edge. We set W to be the set of newly introduced 
vertices. A subset S of V(G) is a feedback vertex set of G if and only if it is 
a feedback vertex set of G”. Thus we can run the polynomial-time algorithm 
of Lemma on the SPECIAL DISJOINT FVS instance (G', W). 














12.3 Representative sets 


Consider the following game, played by two players, Alice and Bob. Alice has 
a family A of sets; all of Alice’s sets have the same size p. Bob, on the other 
hand, has only one set B of size q. Alice does not know Bob’s set B, but she 
does know q. The game is very simple; they sit around for a while in silence, 
and then Bob reveals B to Alice. At this point Alice wins if she has a set 
A € A such that A is disjoint from B. If Alice does not have such a set, then 
Bob wins. 

Of course this is not much of a game; the outcome is predetermined by 
the family dealt to Alice and the set dealt to Bob. However, there is a catch. 
Alice is blessed with poor memory, so she really does not want to remember 
all of A. In particular she will choose a subset A’ of A to remember and 
forget the rest. When Bob’s set B is revealed she can only look in A’ for a set 
that is disjoint from B. Even though Alice has poor memory she really hates 
losing to Bob, and so she does not want to forget a set A € A if that could 
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make the difference between winning and losing the game. Thus she wants 
to remember a subset A’ C A such that for any set B of size q, if there is an 
A € A that is disjoint from B, then there is an A’ in A’ that also is disjoint 
from B. The family A’ “represents” A in the sense that A and A’ win against 
exactly the same sets B. This brings us to the definition of representative 
sets. 


Definition 12.11. Let A be a family of sets of size p. A subfamily A’ C A is 
said to q-represent A if for every set B of size q such that there is an A € A 
that is disjoint from B, there is an A’ € A’ that also is disjoint from B. If A’ 
q-represents A, we write A’ C4, A 


=rep “* 


Is there a nontrivial upper bound on the number of sets that Alice has to 
remember? And if so, does there exist an efficient algorithm that given the 
family A and q decides which sets to remember and which to forget? The 
answer to both questions turns out to be yes, as we soon shall see. 

Let’s ask a different question. Suppose A = {Aj, Ao,... Am}. Is there a 
necessary condition for Alice not to be able to forget Ai? Well, forgetting 
A, should make the difference between winning and losing for at least one 
possible set Bı. In particular there has to exist a set Bı of size q such that A, 
is disjoint from Bı, and Bı has nonempty intersection with every other set 
A; € A, i #1. Suppose now that Alice cannot forget any set A; € A. By the 
reasoning above, for each i < m there exists a set B; such that B; N A; = 0 
and B;A; ¥ 0 for all j # i. Observe that if i A j the sets B; and Bj must be 
distinct, since B;N A; = 0, while B; N A; 4 0. The next lemma from extremal 
combinatorics, called Bollobés’ lemma, upper bounds m in terms of p and q, 
thus giving an upper bound on the number of sets Alice has to remember. 


Lemma 12.12. Let A = {A), Ag... Am} be a family of sets of size p, and 
B = {B,,Bo...Bm} be a family of sets of size q, such that for every i, 
AiN B; =, and for every i,j such that i £ j, AiN B; #0. Then m < Ca 


Before we give a proof of Lemma [12.12] let us note the consequences it 
has for representative sets. By the reasoning above, if Alice has the family 
A = {A,, Ao,... Am} and cannot forget any single set A;, then there exists 
a family B = { B1, By... Bm} of sets of size q that satisfies the conditions of 
Lemma But then, the lemma implies that |A| < (Pra); If Alice has 
more than BTA) sets then there exists one set that she may safely forget. 
Formally, for every family A of sets of size p there is a subfamily A’ C4., A 
of size at most Car We are now ready to give the proof of Lemma [12.12] 
This proof is a surprising application of the probabilistic method. 


Lemma |12.12| implies that Alice never needs to remember more than 
(a sets. 
p 
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Proof (of Lemma|12.12)). Consider a random permutation o of the universe. 
For each i < m, let “event 7” be the event that all of A; appears before all of 
B; according to ø. The probability that event i occurs is exactly 


1 
P+) * 
oy 
Suppose now that all of A; appears before all of B;. Then all of A; may not 
appear before all of B; for j # i. This is because BN A; 4 0 and A;NB; # 9, 
and hence the elements in B; N A; appear before the elements in A; N Bi. 
Thus, for i Æ j, event i and event j are disjoint. In a probability distribution, 


the sum of the probabilities of all the different outcomes is 1. Thus the sum 
of the probabilities of disjoint events is at most 1, and hence 

















Multiplying both sides by Ce) proves the lemma. 





At this point we arrive at an intriguing computational problem. We are 
given as input a family A of sets of size p, and an integer q. We know from 
Lemma [12.13] that there exists a family A’ C A of size at most oe such 
that A’ qrepresents A, but how do we find such an A’? The derivation of 
A’ from A seems “nonconstructive” in nature because the family B is not 
given to us as input. It is in fact possible to compute for each A; whether 
there exists a set B; of size q disjoint from A;, but intersecting all other 
sets in A. This amounts to solving a d-HITTING SET instance with d = p. 
This gives an algorithm for computing A’ from A. This algorithm is explored 
in Exercise [12.9] However, we will soon give a faster algorithm for a more 
general variant of the problem using completely different techniques. 


12.3.1 Playing on a matroid 


We now slightly change the game played by Alice and Bob. The board is now 
a matroid M with universe U and a family F of independent sets. Alice has 
a family A of subsets of U; all of her sets have the same size p. Bob has a set 
B C U. In the previous game, all Alice had to do was to come up with a set 
which was disjoint from B. We make Alice’s task a bit harder. 


Definition 12.13. We will say that A fits with B if A is disjoint from B and 
AU B is independent. 


When Bob reveals his set B, Alice wins if she has a set A which fits B. 
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Just as before, Alice is to remember a subfamily A’ C A, and when B is 
revealed she is only allowed to look for A in A’. Just as before, she does not 
want to forget any sets in A if that could make the difference between winning 
and losing the game. This brings us to a generalized notion of representative 
sets, tuned to this new setting. 


Definition 12.14. Let M be a matroid and A be a family of sets of size p 
in M. A subfamily A’ C A is said to q-represent A if for every set B of size 
q such that there is an A € A that fits B, there is an A’ € A’ that also fits 
B. If A’ q-represents A, we write A’ C4, A 


rep “*- 


First, observe that because of the hereditary property of matroids, Alice 
never needs to remember any sets in A that are not independent in the 
matroid. Similarly she may assume that Bob’s set B is independent, since 
otherwise surely no set in A fits B. 

Compare Definition with Definition Definition makes 
Alice’s job harder, because Alice now has to remember sets that not only 
are disjoint from B, but fit B as well. At the same time Definition 
makes Alice’s job easier, because she only needs to consider those sets B 
that fit some set in A, instead of all the sets that are disjoint from some set 
in A. Observe also that if the matroid M is the uniform matroid of rank at 
least p + q then the two definitions coincide. Thus Definition [12.14] is truly a 


generalization of Definition |12.11 


If we use representative sets without defining which matroid we are 
working with, then the matroid in question is the uniform matroid of 


rank p + q, and Definition |12.14)and Definition |12.11| coincide. 


For the first game we showed that Alice only needs to remember Ge) sets. 
What about this new game? Next we will prove that even in this new setting, 
remembering ey sets is sufficient, and that we can efficiently compute 
which sets to remember and which to forget. 

In the following we will refer to a family of sets of size p by a p-family. The 
constant w is the matriz multiplication constant, that is, the infinimum of the 
set of reals c such that multiplication of two n x n matrices can be done in 
O(n°) time. It was shown by Vassilevska Williams that w < 2.373. 


Theorem 12.15. There is an algorithm that, given a matriz M over a field 
GF(s), representing a matroid M = (U, F) of rank k, a p-family A of in- 
dependent sets in M, and an integer q such that p+q = k, computes 
a q-representative family A’ Ci, A of size at most ey using at most 


OAPI)” + (2795) operations over GF (s). 
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Theorem |12.15|implies that Alice never needs to remember more than 
(Ge) sets, and that she can compute which sets to remember and which 
to forget in time polynomial in |A]. 


In the proof of Theorem [12.15] we assume that the matrix M has full row 
rank, i.e., that M has exactly k rows. This assumption is justified by the 
discussion at the end of Section [12.1.1] Before commencing with the proof of 
Theorem we review a useful formula for the determinant of a matrix. 

For a matrix M with k rows and a subset J of {1,...,k}, we define M[J] to 
be the submatrix of M containing all the rows indexed by J. We will denote 
{1,...,k}\ I by J, and use a shorthand J` I for Mier? 


Proposition 12.16 (Generalized Laplace expansion). Let M, be a (p+ 
q) x p matriz, and Mp be a (p + q) x q matrix. Then, 


det({Ma|Mp]) = (CDP XO (-1) 7 det(Mal]) det(Me[7]). 
IC[p+q],|I|=p 


By |[M4|Mp] we mean the matrix obtained from Ma and Mpg by appending 
for each i < p+ q the i-th row of Mpg to the i-th row of M4. 


The proof of the Generalized Laplace expansion can be found in most text- 
books on linear algebra (see, for example, |371]) and we omit it here. 

How does the Generalized Laplace expansion of determinants relate to 
representative sets? In fact, the relationship is quite direct. For an edge set 
A, define the matrix M4 to be the submatrix of the representation matrix M 
containing the columns of M corresponding to elements of A. We have the 
following observation. 


Observation 12.17. An edge set A of size p fits an edge set B of size q if 
and only if the determinant of [M/4|Mz] is nonzero. 


Proof. Suppose A fits B. Then the p + q columns of M corresponding to 
AU B are linearly independent, and hence det{M,4|Mp] 4 0. If ANB 40 
then there is a column that appears both in M, and in Mg. Hence the 
columns of [M,4|Mg] are not linearly independent and so det[M,|Mz] = 0. 
Furthermore, by the definition of representation matrices, if A and B are 
disjoint but AU B is not independent, then the columns of [M4|Mp] are 
linearly dependent and det[M,|Mp] = 0. 














Suppose we have in our possession an edge set, A of size p, and know that in 
the future an edge set B of size q will arrive. When B arrives, we are going to 
check whether A fits B by making use of Observation [12.17] In particular we 
will compute det(|M4|Mp]) using the Generalized Laplace Expansion. Even 
though we do not know B yet, we can perform quite a lot of this computation 
just with the knowledge of the matrix M, the set A, p and q. 
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Order the Ca) subsets I of {1,...,k} of size exactly p into J, Io,... Ip, 
where l = (P79): For an edge set A of size p, define a Cy?) dimensional 
vector v^, where the i-th entry of v4 is 


vf = det(Mal[I;)). 


For each i < £, we have that I; = {1,...,k}\ I; is a set of size q. For each 
edge set B of size q, we define a Cy = ey) dimensional vector 0#, where 
the i-th entry of v? is 

oP = det(Mp{Ij)). 


Finally, for ease of notation, we define 
a; = (-1)>*. 


The following observation follows directly from Observation |12.17| and the 
Generalized Laplace expansion of det([M,4|Mz)). 


Observation 12.18. An edge set A of size p fits an edge set B of size q if 
and only if a oso? £0. 


Knowing A, we may precompute v4. When B is revealed we compute 
uP from B, and then test whether D ojvvP is nonzero. If it is 
nonzero then A fits B, otherwise A does not fit B. 


We are now ready to prove Theorem [12.15] 

Proof (of Theorem [72. 15). Given M, A, p, and q, the algorithm computes 
for each A € A the vector v4. This takes time O(|A|(?3")p*), since we 
need to compute |A| (235) determinants of p x p matrices, and computing the 
determinant of an n x n matrix can be done in O(n”) time. 

The algorithm arranges the vectors {v4 : A€ A}ina (7) x |A| matrix 
X, and finds a column basis Z of X. Since X has Ea) rows, it follows that 
the size of the column basis Z is at most (PTa); Finding a column basis of 
an n x m matrix can be done in time O(mn*~!) (see [48]), thus it takes 
Ale time to find the basis Z. The algorithm outputs the set 


A ={A: vf Ez} 


We remark that the given time bounds are in terms of the number of 
field operations (additions and multiplications) that we need to perform. 
When working with integers modulo a prime s, this incurs an overhead of 
O(log s log log s) in the running time. 

Since |Z| < gear it follows that |A’| < (ETN, It remains to show that 
A’ actually q-represents A. Consider any set B of size q that fits some set 
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A € A. We need to show that there is a set A’ € A’ that also fits B. By 
Observation [12.18] we have that sa ous oP £0. 

Since Z is a column basis for the matrix X, and v4 is a column of X, it 
follows that we can assign a field element Ax for each A’ € A’ such that 


We then have that 
£ £ £ 
A=B A' -B A' -B 
OF Now T = DD 5 Tiru; U = > Aa! So oy; oy. 
i=1 i=1 A'EA' AEA iml 


This means that there has to be at least one A’ € A’ such that S awe oP + 
0. But then Observation |12.18] implies that A’ fits B, completing the proof. 














Note that Theorem requires the matroid M to have rank exactly 
p+q. If M has larger rank, one can first truncate it to rank p+ q by taking 
the intersection of M with a uniform matroid of rank p + q, and compute 
representative sets in this new matroid, which has rank p+ q. To compute a 
representation matrix of the intersection of M and the uniform matroid, we 
may use Lemma [12.5} Using Lemma [12.5] has the consequence that the algo- 
rithm for computing representative sets becomes randomized. Furthermore, 
Lemma [12.5] only works if M has a representation over a sufficiently large 
field. These issues will not show up when dealing with the problems consid- 
ered in this chapter, but they do show up in more complex applications 
of representative sets. 


12.3.2 Kernel for d-HITTING SET 


In Chapter [2| we gave a kernel for the d-HITTING SET problem of size O(kt 
d! - d) based on the sunflower lemma. We now show how representative sets 
can be used to get a kernel of size Oe) - d). This is upper bounded 


by O(k? - a) = O(k“). Hence the size bound of this kernel has the same 
exponent as the sunflower lemma-based kernel, but with a better dependence 
on d in the constant factor. 

In the d-HITTING SET problem we are given as input a family A over a 
universe U, together with an integer k, and all the sets in A have size at 
most d. Using Exercise [2.27| we may alternatively focus on the Ed-HITTING 
SET problem, where every set in A is of size exactly d. The objective is to 
determine whether there is a set B C U of size at most k such that B has 
nonempty intersection with all sets in A. 
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We model the Ed-HITTING SET problem as a game between Alice and 
Bob, just as described in the beginning of Section The board is the 
uniform matroid with edge set U and rank k + d. Hence a set A of size d fits 
a set B of size k if and only if they are disjoint. 

Alice is trying to prove that the Ed-HITTING SET instance does not have 
a solution, while Bob is trying to demonstrate a solution to Alice. In the 
beginning, Alice gets the family A, while Bob secretly thinks of a tentative 
solution set B C U of size k. Then Bob reveals B to Alice, and Alice either 
finds a set A € A that fits B, proving that B is not a hitting set, or if no such 
A exists, concedes that B is indeed a solution. Just as in Section [12.3] Alice 
does not have to remember all of A; a k-representative subfamily A’ cs A 
suffices. 


Theorem 12.19. Both Ed-HITTING SET and d-HITTING SET admit kernels 


with at most (9) sets and at most a) - d elements. 


Proof. We first focus on Ed-HITTING SET. Given the instance (A, U, k), we 
compute a k-representative subfamily A’ of A of size at most (ra) using 
Theorem To invoke Theorem we need a representation matrix 
of the uniform matroid with edge set U and rank k + d; such a representation 
matrix may be computed in polynomial time (see Section [12.1.2}. Finally we 
remove from U all the elements that do not appear in any set of A’. Let U’ 
be the resulting universe; clearly |U’| < (Ta) -d. The kernelization algorithm 
outputs the instance (A’, U’, k). It remains to prove that A has a hitting set 
of size at most k if and only if A’ does. 

First, any set of size at most k that has nonempty intersection with all 
sets in A also has nonempty intersection with all sets in A’. For the reverse 
direction, suppose that there is a hitting set B of size k for A’, but that B 
is not a hitting set of A. If B is not a hitting set of A, then there is a set 
A € A that fits B. But A’ k-represents A and hence there is an A’ € A’ that 
fits B. This contradicts that B is a hitting set of A’. 

For d-HITTING SET, observe that the reduction of Exercise [2.27] does not 
change the number of sets in an instance. 














12.3.3 Kernel for d-SET PACKING 


In the d-SET PACKING problem we are given as input a family A over a 
universe U, together with an integer k. All the sets in A have size at most d. 
The objective is to determine whether there is a subfamily S C A of size k 
such that the sets in S are pairwise disjoint. In Ed-SET PACKING, every set 
of A is required to have size exactly d. 

In Chapter |2| (Exercise you are asked to make a kernel for d-SET 
PACKING using the sunflower lemma. This kernel has size O(d!d?k“). Using 
representative sets, we give a kernel for Ed-SET PACKING with EA < efki 
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sets and at most ons -d < e4dk? elements; by using Bag the same 
kernel bounds hold for the d-SET PACKING problem. Thus the kernel given 
here has the same exponent as the kernel from Chapter |2| but the kernel 
given here has better dependence on d in the constant. 


In the kernel for Ed-HirrinG SET in Section the input family 
A was a family of constraints, and we were trying to find irrelevant 
constraints that could be dropped without turning a no-instance into a 
yes-instance. For Ed-SET PACKING the family A represents possibilities, 
since A contains the sets we can select into the solution. We are now 
interested in forgetting some of these possibilities while making sure we 
do not turn a yes-instance into a no-instance. 


Theorem 12.20. Both Ed-SET PACKING and d-SET PACKING admit kernels 
with at most (EA sets and at most (E3) - d elements. 


Proof. As discussed earlier, it suffices to focus on Ed-SET PACKING. Given 
the instance (A, U, k), we compute a (k — 1)d-representative subfamily _A’ of 
A of size at most Ea using them 25 When invoking Theorem 
the matroid is the uniform matroid with edge set U and rank kd. Then we 
remove from U all elements that do not appear in any set of A’. Let U” be the 
resulting universe; clearly |U’| < ie -d. The kernelization algorithm outputs 
the instance (A’,U’,k). 

Clearly, any subfamily S C A’ of size k such that the sets in S are pairwise 
disjoint is also such a subfamily of A. It remains to prove that if there is a 
subfamily S C A of size k such that the sets in S are pairwise disjoint, then 
there is such a subfamily S’ C A’. Suppose there is a subfamily S C A of 
size k such that the sets in S are pairwise disjoint. Of all such families, pick 
the one with most sets in A’. Suppose for contradiction that S contains a set 


A € A\ Æ. Define 
B= |J s. 


SES\{A} 


Since all sets in S are pairwise disjoint, it follows that A fits B. But then 
there exists a set A’ € A’ that also fits B. Then A’ is disjoint from all sets 
in S \ {A}. Consequently, S’ = (S U {A’}) \ {A} is a subfamily of A of size 
k such that the sets in S’ are pairwise disjoint. Furthermore, S’ has more 
sets from A’ than S does, contradicting the choice of S. We conclude that 
S C A’, completing the proof. 
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12.3.4 £-MATROID INTERSECTION 


In Section [12.2.1] we showed that (-MATROID INTERSECTION is NP-complete 
for £ = 3. We now give a randomized FPT algorithm for ¢-MATROID INTER- 
SECTION, parameterized by £ and the size k of the independent set searched 
for. We first give an algorithm for an intermediate problem, MATROID Ed- 
SET PACKING. Here, we are given integers d and k, a matrix M over GF(s) 
representing a matroid M of rank kd with universe U, and a family A of 
subsets of U, where all sets in A have the same size d. The objective is to 
determine whether there is a subfamily S C A of size k such that the sets 
in S are pairwise disjoint, and such that U5-5 9 is independent in the ma- 
troid M. Note that the only difference between MATROID Ed-SET PACKING 
and Ed-SET PACKING is the last matroid independence constraint. It should 
therefore come as no surprise that the same techniques that work for Ed-SET 
PACKING also work for MATROID Ed-SET PACKING. 


Theorem 12.21. There is an algorithm for MATROID Ed-SET PACKING 
that uses |A\O + kO) operations over GF(s) and at most O((|A|O™ + 
kO(4k)) log slog log s) time. 


Proof. Using Theorem we compute a d(k — 1)-representative subfamily 
A’ C A of size at most (%7) < (ek). This requires at most |A|° operations 
over GF(s). We claim that if there is a solution S C A of size k such that all 
sets in S are pairwise disjoint and Uses S is independent M, then there is 
such a family S’ C A’ as well. The proof of this claim is almost identical to 
the proof of Theorem [12.20] and therefore omitted. 

Now we can by brute force iterate over all subfamilies S’ C A’ of size 
k, and check whether all the sets in S’ are pairwise disjoint and Uses S 


is independent in M. There are at most (5) < k°(¢*) possible choices 
for S’. For each choice of S’ we can verify whether S’ is a solution using 
(k + d)° operations over GF(s). Thus the second part of the algorithm 
uses k(4*) operations over GF(s). The operations over GF(s) dominate the 
running time. Each such operation is implemented in O(log s log log s) time, 
yielding the claimed running time bound. 














Even though the proof of Theorem[12.21]is basically identical to the proof 
of Theorem we give a kernel for Ed-SET PACKING in Theorem 
while The only gives an FPT algorithm for MATROID Ed-SET 
PACKING. The reason for the difference is that, in the proof of Theorem|[12.21| 
even after we have reduced the family A to size (ek)%, we still need to keep 
a representation of the matroid M. Of course it is sufficient to only keep the 
part of the representation of M which corresponds to elements appearing 
in some set in A. However it is possible that s is so much bigger than k 
and d, that storing a single field element requires a number of bits which is 
superpolynomial in k. This is not a problem for the FPT algorithm, but 
prevents us from stating the result as a polynomial kernel. 
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We are now ready to give an FPT algorithm for (-MATROID INTERSEC- 
TION. Here, we are given £ representation matrices M4, ... Mọ (over GF(s)) 
of matroids M,,...Me over the same universe U, and an integer k. The 
objective is to determine whether there exists a set S C U of size k which 
is simultaneously independent in all the matroids Mı, ... Me. We give an al- 
gorithm for this problem in the case that all the input matrices Mi, ... Me 
have rank exactly k. 


Lemma 12.22. There is an algorithm for --MATROID INTERSECTION in the 
case that all input matrices M1, ... Me have the same rank k. The algorithm 
uses at most |U|O\ +kO) operations over GF(s) and at most O((\U|OCM + 
kO(f*)) log slog log s) time. 


Proof. Given as input an instance of 4 MATROID INTERSECTION we construct 
an instance of MATROID Ed-SET PACKING with d = ¢. First we make £Z 
disjoint copies of the universe U, call these copies U,,...U. Then, for each 
i, let M*¥ be a copy of the matroid M;, but over the universe U; rather 
than U. We may of course use the matrix M; as a representation matrix 
of M}. Let M* be the direct sum of MT, M3,...,M7. Using the method 
described in Section [12.1.5] we can obtain a representation matrix M* of M* 
in polynomial time. M* has rank k£, and will be the matroid in the instance 
of MATROID Ed-SET PACKING. 

We now describe the family A. For each element u € U, let Su = 
{uy, U2,-..,Ue} where u; is the copy of u in U;. We set A = {Su : uE U}. 
Clearly all sets in A are pairwise disjoint. Furthermore, a subset S C U is 
simultaneously independent in M,,Mo2,...,Me if and only if U es Su is 
independent in M*. Therefore the input instance to 4 MATROID INTERSEC- 
TION is a “yes” instance if and only if there is a subfamily S C A of size k so 
that all sets in S are pairwise disjoint and U, es Su is independent in M*. 
Thus we can run the algorithm of Theorem [12.21]on the produced MATROID 
Ed-SET PACKING instance and return the same answer as this algorithm. 
Theorem [12.21] yields the claimed running time bound. 














At this point we would like to get rid of the assumption of Lemma 
that the matrices M1, ... Me have the same rank k. One can use Lemma 
to turn all the matrices into matrices of rank at most k, at the cost of mak- 
ing the algorithm randomized. Unfortunately Lemma [12.5] requires the field 
GF(s) to be big enough in order to work. If s is too small, it is possible to 
transform the representation matrices M,... Mọ into representation matrices 
over a larger field GF(s’), where s’ is sufficiently large to apply Lemma [12.5] 
and guarantee small constant error probability. However, this step requires 
using fields that are not just integers modulo a prime. We omit the details. 


Theorem 12.23. There is a randomized algorithm for —-MATROID INTER- 
SECTION. The algorithm uses at most O((\U|O + kO)) (log s)O) time, 
and produces the correct answer with probability at least 2. 
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12.3.5 LONG DIRECTED CYCLE 


We now give an example of how representative sets can be a useful tool in 
the design of graph algorithms. In particular, we will consider the LONG DI- 
RECTED CYCLE problem. Here, the input is an n-vertex and m-edge directed 
graph G and a positive integer k. The question is whether G contains a di- 
rected cycle of length at least k. In this section, we will give an algorithm for 
LONG DIRECTED CYCLE with running time 26.9*nO, 


The closely related problem where we are interested in finding a cycle 
of length exactly k can be solved in 20“)n°) time by color coding. 
However, color coding does not seem to be able to handle the LONG 
DIRECTED CYCLE problem, because here the shortest cycle out of the 
ones of length at least k could have length much bigger than k. With 
a cute trick, it is possible to solve LONG DIRECTED CYCLE in time 
20 (k log i), OU) (see Exercise|5.9), but getting the running time down to 
20(%) nO is more challenging. 


The starting observation is that G has a cycle C of length at least k if and 
only if G contains a simple path P on k vertices such that one can get back 
from the endpoint of P to the starting point without having to intersect with 
P on the way back. 


Lemma 12.24. G has a cycle C on at least k vertices if and only if there 
exist vertices u, v, a simple path P, from u to v on exactly k vertices, and a 
simple path P> from v to u such that V(P1) A V (P2) = {u,v}. 


Proof. Given C, let P; be the first k vertices of C and let u and v be the 
first and last vertices of P,, respectively. Let P> be the path back from v to 
u along the cycle C. For the reverse direction, P, U P> forms a cycle on at 
least k vertices. 


We can use Lemma [12.24] to make an n*+O(-time algorithm for LONG 
DIRECTED CYCLE. For every choice of u and v and every path P, from u to 
v on exactly k vertices we check in polynomial time whether there is a path 
P, from v to u in G — (V (P1) \ {u,v}). The algorithm runs in time n*tOM 
since there are at most n* choices for P}. 

Is it really necessary to try all possible choices for P,? As we now shall 
prove, it is in fact sufficient to restrict the choice of P; to a relatively small 
representative family. To that end, we need to define some notation. For every 
pair of distinct vertices u, v and integer p, we define 














P= {x : X CV(G),|X| = p, and there is a directed uv-path in G[X] 


visiting all vertices of X \. 
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For each distinct pair u, v of vertices, each integer p < k, and integer 
q < 2k — p, we will compute a qrepresentative family 


DPI C4 p 
Fin Cep Piw 


The matroid with respect to which we define the q-representative family Ppa 
is the uniform matroid with edge set V (G) and rank p + q. Observe that 
Theorem [12.15] implies that, for each choice of u, v, p, and q, there exists a 
q-representative subfamily P?:4 of P?, of size at most (P+) < 4¥. However, 


directly computing Pra from PR, by using Theorem |12.15| would take time 
nÊ(), since it would take this much time just to list the families P?,,. For 


now, let us not worry about how exactly the representative families P?:? will 
be computed, and instead focus on how we can use them once we have them. 


Lemma 12.25. If there exists a cycle C of length at least k, then there exist 
vertices u, v, and a path P, such that V(P,) € PE:4, q < k, and there exists 
a path P> from v to u with V(P1) O V (P2) = {u,v}. 


Proof. Let C = v1, v2,V3,...,v, be a shortest cycle out of all cycles in G of 
length at least k. Let Pj be the path vj, v2,v3,...,uUg% and Pz be the path 
Uk, Uk+1; +++, Ur, U1 (see Fig. |12.1). We now define a vertex set B as follows. 


B= {Uk+1, Uk+2, - += VOR} if r > 2k, 
{Uk+1, Uk+2,---, Ur} otherwise. 


We set u = v1, v = vp and q = |B| < k, and observe that V (P!) € PE, and 
that V (Pi) fits B, as they are disjoint. Since Pha qrepresents P*.,, it follows 
that there exists a path P, such that V(P,) € P*:4 and V(P,) fits B. 

We claim that V(P,) O V(P2) = {u,v}. If r < 2k, then this follows im- 
mediately, since V(P,) is disjoint from B and B contains all internal vertices 
of P2. Thus, suppose for contradiction that r > 2k, but that V(P,) N V(P2) 
contains a vertex different from u and v. 

Let v; be the first vertex on P> such that v; € V(P1) NV (P2) \ {u,v}, and 
let P} be the subpath of P, from v to v;. Since V(P,) fits B, it follows that 
i > 2k and hence P% is a path on at least k, but strictly less than |V(P2)|, 
vertices. Furthermore, the choice of v; implies that V (P1) NV (P3) = {v, vi}. 
Let Py be the subpath of P; from v; to v. Appending Př to Px yields a 
simple cycle C* on at least k vertices. However, the number of vertices on 
C™ is 


[VPP] + [V (P| = 2 < |V (P) + IV (P2)| = 2 = |V (C)|. 





This contradicts the choice of C. 











Our algorithm for LONG DIRECTED CYCLE will compute representative 
families PP:7 for each choice of u, v, p < k, and q < 2k — p, and then use 
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U2k+1 U2k U2k+1 U2k 





Fig. 12.1: Illustration for the proof of Lemma |12.25 


Lemma [12.25] together with Lemma [12.24]to check whether there is a cycle 
of length at least k in G. All that remains is to find an efficient algorithm to 
compute the representative families P?:4. 

In order to compute the representative families Pra we will exploit the 
fact that a simple recurrence holds for the sets of paths P?,,. This recurrence 
uses the subset convolution operation on set families; we now introduce this 
operation. Given two families A and C the convolution of A and C is a new 
family A x C defined as follows. 


AxC={AUC : AE A,CEC, and ANC =}. 


In Section [10.3] we defined the subset convolution operation for functions 
that take as input a subset of a universe and output an integer. The rela- 
tionship between subset convolution for set families defined here and subset, 
convolution of functions is very close, and explored in Exercise [12.12] _ 

The crux of the algorithm for computing the representative families P2?;4 
is that the following recurrence holds for the set (of vertex sets) of paths of 
length exactly p. In particular, for all u, v and p > 3 it holds that 


P= (J Phat *{{v}}. (12.1) 


wvEE(G) 


To see that the equation holds, observe that a path from u to w of length 
p— 1 can be extended into a path of length p from u to v if and only if wv is 
an edge and the path from u to w does not already visit v. We now describe 
how can be used to compute the representative families P?:7. 
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12.3.5.1 Combining representative families 


An important feature of is that it only uses U and * operators. In this 
section we will show that whenever a family A can be built up from simpler 
families by only using these operations, a representative family for A can be 
efficiently computed by repeatedly combining simpler representative families 
by the same operations. 

It is a quite common setting that there is a large family for which we know 
a small representative family exists, but are unable to compute it directly 
because the original family is just too big. In such a setting the tools we 
develop here often come in handy. We first turn to the union operation. 


Lemma 12.26. If A, and Az are both p-families, A‘, q-represents A, and 
5 q-represents Az, then A‘, U A) q-represents A; U A2. 


Proof. Consider a set B of size q such that there is a set A € A; U Ao that 
fits B. If A € Aj, then there is an A’ € A} that fits B. If A € Ag, then there 
is an A’ € Aj that fits B. 














Thus, if we want to compute a q-representative family of A, U Ag, it is 
sufficient to first compute a g-representative family Aj of Aj, then another 
q-representative family A of A2, and return A} U A5. At this point there is a 
snag, Theorem|12.15}tells us that there is a q-representative family of A;UA2 
of size Cou If A| and Ai were the result of applying Theorem |12.15}to A, 
and Ag respectively, their size could also be as much as (Era): But then A} U 
A would have size as much as 2(” ae How can we get rid of this annoying 
factor 2? Well, we could just compute a q-representative family of A* of 
A UA; by invoking Theorem|12.15}again. Note that invoking Theorem|12.15 
on Aj U Aj is much less costly than invoking it on A; U Ag, since A4 U A5 
is already a pretty small family of size just a little bit more than (re): All 
that remains is a sanity check that A* actually q-represents A, U Ao. 


Lemma 12.27. If A* q-represents A’ and A' q-represents A, then A* q- 
represents A. 


Proof. Consider a set B of size q such that some A € A fits B. Since A’ 
q-represents A, there is an A’ € A’ that fits B. Now, since A* q-represents 
A’, there is an A* € A* that fits B. 














We now give an analogue of Lemma [12.26] for subset. convolution. Subset 
convolution has a slightly more complicated behavior with respect to repre- 
sentative sets than union, because the sets in the family A, * A2 are bigger 
than the sets in A, and Ag. 


Lemma 12.28. Let A; be a pi-family and Az be a p2-family. Suppose A} 
(k — pi)-represents A, and AS (k — p2)-represents A2. Then Aj * AS (k — 
pı — p2)-represents A, * A2. 
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Proof. Suppose there is a set B of size (k — pı — p2) that fits a set (A; U A2) € 
A, x Ao, where Ay & Al, A» & Ao, and Ay N Ag = 0. Then |BU Ap| = k — pı 
and so there is a set A} € Aj that fits BU Ag, that is, |A] U A2 U B| = k and 
A’ U A2U B is independent. This means that Ag € Ag fits with Ai U B. Now, 
there is a set A, € Aj that fits A} U B as well, and so |A} U A4 U B| = k and 
Ai, UASUB is independent. But then A} UA% fits B, and (A1 U A$) € A) * A$. 














Consider a pı-family A; and a po-family Az with p = pı + po. In order to 
compute a g-representative family of A; * A2, we can first compute a (q +p2)- 
representative family A} of A, then a (q + p1)-representative family Aj of 
A2, and output A‘, * Aj. However A| * Aj could be quite large compared 
to (P39, and so we might want to run Theorem |12.15|0n Aj x Aj to get a 


representative family A* of size (? 5) Note that computing Aj * A) from A‘ 
and A’ takes roughly |A1|: |43| time. On the other hand, computing A* from 
A‘ x Ah takes around |.A‘|-|A$|- om time, so this step is the more costly 
one. Nevertheless, if A, and Ah both have size 20+, then Ai * AS can be 
computed from A‘, and Aj in time 20+® times a polynomial overhead. 


12.3.5.2 Long directed cycle, wrapping up 


We are now ready to return to the LONG DIRECTED CYCLE problem. All that 
was left was to compute, for every u, v, p < k, and q < k, a q-representative 
family PP:4 of P?,. The size of P?:4 will be upper bounded by (a For 
reasons that will become apparent soon, we compute a few extra represen- 
tative families, in particular, we will compute representative families for all 


q < 2k — p rather than only for q < k. To that end we will combine (12.1) 
with the tricks we developed in Section |12.3.5.1 


Lemma 12.29. There is an algorithm that, given G and k, computes, for 
every pair u, v of vertices and every integers 2 < p < k and q < 2k — p, 
a family PP: that q-represents P?,. The algorithm takes time AwkyOW) < 
26.9" n00, 


Proof. For p = 2, it is very easy to compute representative families of P2,,, 


because the families P2, themselves are so simple. Specifically, we have that 
P2, = {{u, v}} if uv € E(G) and P2, = 0 otherwise. So we just set 


2,4 _ p2 
Pal = Puy 


for every u, v and q < 2k — 2. Clearly, P24 q-represents P2, and |P2.9| <1< 
(Pra; Computing these families takes polynomial time. 

At this point the algorithm enters a loop in which p iterates over the set 
{3,4,...k} in increasing order. The loop invariant is that at the start of each 
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iteration, for every u, v, 1 < p' < p and q < 2k — p', a family PP’. of size at 
most (? aa that represents P?’, has already been computed. 


In each iteration the algorithm first computes a new family P2:7 for each 
u, v and q < 2k — p as follows 


P= A Rig es (12.2) 
wve E(G) 
Lemmas }12.26]and |12.28)immediately imply that Ppa q-represents PR,- Fur- 
thermore, 
Pra ay (12.3) 
and therefore 
Peal < (PF 4)n. 
p 


Furthermore, computing P?;4 using Equation 12.2|takes time ORE) nO), 
Now, the algorithm computes a q-representative family 


pra ca pra 


uv =rep ” wv 


by applying Theorem |12.15]on Pra, This takes time 


w—l1 w 
[Pra (’ T i nO < a ‘) you). 
Pp P 


Since q-representation is transitive (Lemma |12.27), we have that Pra also 
g-represents P?,. Furthermore 


[P24] < : ia) 
P 


and so we have managed to compute the families Ppa as desired. Thus, 
assuming that the loop invariant holds at the beginning of the iteration, it 
holds at the beginning of the next iteration as well. Hence, by the end of the 
last iteration, the algorithm has computed the families P?:7 with the desired 
properties for every pair u, v of vertices and all integers 2 < p < k and 
q < 2k — p. This concludes the proof. 


With Lemma [|12.29|at hand, it is easy to get an algorithm for LONG DI- 
RECTED CYCLE. 














Theorem 12.30. There is a 26.9% n ®© -time algorithm for LONG DIRECTED 
CYCLE 


Proof. The algorithm starts by applying Lemma |12.29| and computing, for 
every pair u, v of vertices and integers 2 < p < k, q < 2k — p, a family 
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Piv Cip Piv 


of size at most (eo). This takes 4¢*n°@) < 26.9'n°@ time. Then the 


algorithm checks, for each u, v, q < k, and set A € Pk:a, whether there exists 


uv ? 


a path P from v to u with AN V (P2) = {u,v}. This takes time 
3 ( E ") nO) < ghrp OW), 
U,V, qQck 


By Lemmas |12.24] and {12.25} such a set A and path P» exists if and only if 
G has a cycle of length at least k. 














12.4 Representative families for uniform matroids 


A common theme for the problems considered so far is that whenever we 
applied Theorem [12.15]in order to compute a representative family, we used 
a uniform matroid as the underlying matroid. Is it possible to get an improved 
variant of Theorem [12.15] that only works for uniform matroids? There are 
two kinds of improvement possible: on the size of the output representative 
family and on the time it takes to compute it. It is possible to show that the 
size bound of Theorem[12.15|cannot be improved, even for uniform matroids 
(see Exercise [12.11}. However it is possible to improve the running time. 


Theorem 12.31. Let M be a uniform matroid and A be a p-family of inde- 
pendent sets of M. There is an algorithm that, given A, a rational number 
0 <2 <1, and an integer q, computes a q-representative family A’ Chp A 
of size at most x~?(1 — x)712°P+4) in time |A|(1 — 2)~ 92°F, 


The proof of Theorem is out of the scope of the book; however 
we will see a proof of a weaker variant of Theorem [12.31] in Exercise 
To get a feel of Theorem [12.15] it is useful to consider what happens when 
= L. In this case the size of the representative family is upper bounded 
by 2?+4¢+0(P+9) and the running time is upper bounded by |A|2¢+°@+9, For 


p = q the size of the output family is essentially the same as the (2) bound 





of Theorem {12.15} while the running time is much better. However, if as 
is sufficiently far from 1/2, then setting « = 1/2 will make Theorem |12.15 


output a representative family which is significantly larger than the optimal 


ee) bound. 


In order to compare Theorems |12.15/and|12.31]it is useful to set x = are 


Since i a 
Or) Sloss) (ara) <0, )ero 
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we obtain the following corollary by setting £ = AT and applying Theo- 


+q 
rem [12.31 


Corollary 12.32. Let M be a uniform matroid and A be a p-family of in- 
dependent sets of M. There is an algorithm that given A and an integer q 
computes a q-representative family A’ Ci, A of size at most 2) 2°(P+9) in 
time |A|(545) 420 +9). 


For uniform matroids, Corollary |12.32|will compute representative families 
much faster than Theorem|12.15| at the cost that the computed representative 
family is a factor 2°?* larger than ies When p and q are of the same order 


of magnitude, the 2°?+® factor becomes negligible, but when p or q is very 
small, such as in the kernelization applications of Sections and 
one may not use Corollary in place of Theorem SEE ea the 
subexponential factor in the size bound of Corollary [12.32] would make the 
upper bound on the kernel size super-polynomial. 

The value z = F is the choice for x that minimizes the size of the 
representative family A’ computed by Theorem [12.31] The reason one might 
sometimes want to invoke Theorem [12.31] with a different value of x is that 
different values of x yield a faster running time for the computation of the 
representative family A’, at the cost of making A’ bigger. 


12.5 Faster LONG DIRECTED CYCLE 


We now have at our disposal a new hammer, Theorem The purpose 
of Theorem is that when we want to compute q-representative sets 
for uniform matroids, Theorem [12.31] does it faster than Theorem In 
Section the application of Theorem [12.15] was the main bottleneck 


for the algorithm for LONG DIRECTED CYCLE. What happens if we just 


replace all the applications of Theorem |12.15| by calls to Theorem }|12.31 
12.29 


More precisely, we will prove a more efficient version of Lemma 


Lemma 12.33. There is an algorithm that, given G and k, computes for 
every pair u, v of vertices and integers 2 < p < k and q < 2k — p, a family 


^A 


PP: that q-represents P?,,. The size of P?:4 is at most 


(2421) (P20). ine, 
D 2q 


and the algorithm takes time 6.75**+°®) nO 





Proof. Consider the algorithm in the proof of Lemma |12.29} For p = 2 com- 
puting the families P?:7 is trivial. For each choice of u, v, 3 < p < k, and q, 
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the algorithm of Lemma [12.29]first computes a family ppa and then applies 
Theorem [12.15] to compute a g-representative family Ppa of PPa, 

The only property from Theorem [12.15] for which we needed to argue 
correctness is that Ppa in fact q-represents P?:7. Thus, if we replace the call 
to Theorem[12.15]by a call to Theorem[12.3T] we still get a correct algorithm. 
All that we need to do is to select the value for x that we will use for each 
of the calls to Theorem [12.31] and analyze the running time and the size of 
the output families. The running time of the algorithm is dominated by a 
polynomial number of calls to Theorem [12.31] Hence, to bound the running 
time of the algorithm it is sufficient to bound the running time of the call to 
Theorem [12.31]that takes the longest time. 

When invoking Theorem in order to compute the q-representative 
family PRA of P27, we will use the value x, q for x, where 


ARETE ae 
Pq pt+2q 


This choice of £p, is not taken out of thin air, it is in fact the choice that 
ends up minimizing the running time of the algorithm. The choice of £p,q is 
discussed in Exercise [12.14 7 

When we apply Theorem 12.31] with T = Lp,q in order to obtain Phs Chep 


PPa, the size of Pra is bounded as follows. 
[PRA] < spg 2019), 
Here, Sp,q is defined as 
Sp,q = (pq) P (1 — £p) *. 


This proves the claimed size bound on Pra, 


From (12.3), we have that |P?:4| < sp—1,q41-n. Thus, when we apply 
12.31 P 


Theorem to compute Ppa from P?:7, this takes time 


uv? 
Sp—1gt1*(1—@pq) *- PPT a), (12.4) 
To analyze the running time further we need the following claim. 
Claim 12.34. For any p > 3 and q È 1, Sp—1,941 < €° ` P` Sp,q- 
Proof. Inserting the definition of £p, into the definition of sp, yields 
5p, =P ?(2q)4(p + 2g)? +. 


This yields the following inequality 
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Sp—-tati _ (p= 1) P+ (2g +2) 77 (p + 2q + 1)” t9 
5p,q pP (2q) (p + 2q)P*4 
Po (a) 8 


(p= LP} (2q+2)t! (p+ 2q)?+ı 


1 p—1 1 p+q 
<p: | 1+ — -1 (1+ 
<p-( =) ( =) 


<p- -e:e=@e p. 











In the last inequality, we used that (1 + 1/x)” < e for every x > 0. 4 
From (12.4) and Claim]|12.34} we have that the running time for computing 
PR:1 from P24 is bounded by 


p 2q 
spa- (1 — Ep4): 2PH) pO < (=) (2 +) _ 20(P+4) , pO). 





The total running time of the algorithm is bounded by the maximum of 


the function i 
p+2q\? (p+2q\" 
foa- (2428) (23 
p q 


times a polynomial in n and a subexponential 2°(P+4) factor. We are interested 
in the maximum of f on the domain 3 < p < k and 1 < q < 2k — p. Simple 
calculus shows that the maximum is attained for p = q = k. Hence the 
maximum of f on the domain 3 < p < k and 1 < q < 2k — p is 


k 
3k\E Lk ae 3y 
=Z] (2) =[3 [> = 6.75". 
k 2k 2 
Then the running time of the algorithm is upper bounded by 6.75*+°) nO), 
completing the proof. 

















We now have a faster algorithm to compute the representative fami- 
lies P?:4. This yields an improved bound for the running time of our algorithm 
for LONG DIRECTED CYCLE. 

We recall the 26.9%n°)-time algorithm for LONG DIRECTED CYCLE given 
in Theorem|12.30] First it computed, for every pair of vertices u,v and integers 
2<p<kand 1 < q < 2k — p, a qrepresentative family P?:4 of PP... 
These families were computed using Lemma pe end therefore have size 
at most 4*. Computing these families took 26.9*n°™) time. After this, the 
algorithm loops over every set in these families and runs a polynomial-time 
check (namely a breadth first search) for each set. This takes 4*n° time. 

The bottleneck in the algorithm of Theorem [12.30]is the call to Lemma 
If, instead of using Lemma[12.29| we use Lemma|12.33] then computing 


the representative families only takes 6.75*+°(*) nO time. However, the fam- 
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ilies may now be bigger, but their total size is not more than 6.75*+°) nO), 
since it took that much time to compute them. Thus, if we now loop over 
every set in the representative families and run a breadth first search, just as 
in the proof of Theorem this will take at most 6.75'°+°) nO time. 
Hence we arrive at the following theorem. 


Theorem 12.35. There is a 6.75*+°) nO -time algorithm for LONG Di- 
RECTED CYCLE. 


12.6 LONGEST PATH 


We can use the insights of the algorithms for LONG DIRECTED CYCLE to 
make a fast deterministic algorithm for LONGEST PATH. Here, we are given 
a directed graph G and an integer k, and the task is to determine whether G 
contains a simple path on at least k vertices. 

In Section [12.3.5] we defined for every pair of vertices u and v and integer 
p > 1 the family 


PL. = {x : X CV(G),|X| = p, and there is a directed uv-path in G[X] 
visiting all vertices of X J 


The LONGEST PATH problem can be reformulated to asking whether there 
exists a u and a v such that P*, is nonempty. Our algorithm will check 
whether PE, is nonempty by computing, for every u, v, 2 < p < k, and 
0< q< k- p, a q-representative family 

Pid Chep Pl 
Here, just as for LONG DIRECTED CYCLE, the underlying matroid is the 
uniform matroid of rank p + q. 

The crucial observation is that Pko O-represents P2,- Thus, if PP, is 
nonempty, then P*,, contains some set A that fits with the empty set 0. But 
then P*.° must also contain a set which fits with Ø, and hence P*:° must be 
nonempty as well. Thus, having computed the representative families Praa, 
all we need to do is to check whether there is a pair u, v of vertices such 
that P*:° is nonempty. All that remains is an algorithm that computes the 
representative families Pra, 


Lemma 12.36. There is an algorithm that given G and k computes for every 
pair u, v of vertices and integers 2 < p < k and q < k — p, a family PP: that 


Aa 


q-represents PR. The size of PR; is at most 


(252) (2420). pine 
D 2q 
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and the algorithm takes time ¢?*+°) nO = 2.619'nOC\. Here, o is the 
golden ratio L5. 

Proof. The proof of Lemma [12.36]is identical to the proof of Lemma [12.33] 
with the only exception being that the q-representative family P?:? of PP, is 
computed for all q < k — p, rather than for all q < 2k — p. In the running 
time analysis, this has the effect that the running time of the algorithm is 
equal to the maximum of the function 


_ (p+2q (25 


times a polynomial in n and a subexponential 2°+® factor. Here, the domain 
of fis2<p<kand0 < q < k—p (as opposed to 0 < q < 2k—p, in the proof 
of Lemma [12.33). Simple calculus shows that this function is maximized for 
p=(1- ak and q = k — p. For these values of p and q, f(p,q) = ¢?* where 


@ is the golden ratio 1/5 Thus the running time of the algorithm is upper 


bounded by ¢2*+°) nO = 2.619*nOM). 




















Armed with Lemma|12.36} all we need to do in order to solve the LONGEST 
PATH problem is to invoke Lemma [12.36|and then check whether any of the 
families P*;° is nonempty. This yields the following theorem. 


Theorem 12.37. There is a ¢?**°) nO = 2.619*nO™ -time algorithm for 
LONGEST PATH. Here, ġ is the golden ratio 15, 


This algorithm is faster than the 4*+°™)n°-time algorithm of Sec- 
tion |*5.4| which is based on divide and color. It is not quite as fast as the 
algorithms presented in Chapter but these algorithms are randomized, 
whereas the algorithm presented here is deterministic. 


Exercises 


12.1 (&). Let G be a graph and let family F contain those subsets of V (G) that are 
independent in G. Show that F is not necessarily a matroid. 


12.2. Let G be a bipartite graph with bipartition U and B. Define a family F of subsets 
of U, where S C U isin F if and only if there is a matching in G such that every vertex in 
S is an endpoint of a matching edge. Prove that M = (U, F) is a matroid. In other words, 
prove that transversal matroids are matroids. 


12.3. Prove Theorem [12.3 


12.4 (&). Give a greedy algorithm that, given a matroid M = (U, F) and a weight 
function w : U > Z, computes a basis of maximum weight. How can we find an independent 
set of maximum weight? (Note that the weights can be negative.) 
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12.5 (2). Let G be a bipartite graph. Find two matroids Mı and M2 such that an edge 
set S C E(G) is a matching in G if and only if it is independent both in Mı and Mg. 


12.6 (&). Prove that the intersection of two matroids is not necessarily a matroid. 
12.7. Prove Lemma [12.5 


12.8 (2). Give a polynomial time reduction from the 2-MATROID INTERSECTION problem 
to MATROID PARITY. 


12.9. Give an algorithm that, given a family A of sets of size p and integer q, computes a 
subfamily A’ Cfep A in time O(JA|? -p1+t00)). Here the underlying matroid is the uniform 
matroid of rank p+q. Use an algorithm for d-H1TTING SET to determine for a given A; E€ A 
whether there is some set B; of size q, disjoint from A;, but intersecting all other sets in 
A. 


12.10 (&). Use Theorem|[12.15] to prove the following generalization of Bollobas’ lemma. 
Let M be a matroid and A = {A1, Ao,..., Am} and B = {B1, Bo,..., Bm} be families of 
independent sets in M such that (a) all sets in A have size p, (b) all sets in B have size q, 
(c) Ai fits Bj if and only if i = j. Then m < (?F"), 


12.11 (Æ). Construct for each p and q a p-family A of size exactly (772) such that no 
proper subfamily A’ of A q-represents A. 


12.12 (&). Let A and B be two families of subsets of the same universe U. Prove that 
(14 * 1g)(S) # 0 if and only if S € Ax B. Here 14 is the characteristic function of the 
family A. More precisely 14 is a function that takes as input a subset S of U and returns 
1 if S € A and 0 otherwise. Recall that the convolution of two functions is defined in 
Section [10.3 


12.13. Give an algorithm that, given as input a family A of sets of size p over a universe 
of size n, computes a q-representative family A’ of A when the underlying matroid is 
the uniform matroid of rank p + q. The algorithm should have running time at most 
| A] -2Ptat+o(@+4)nO() and the size of A’ should be at most 2°+9+°(@+4), Your algorithm 
should use the construction of universal sets given in Chapter [5] 


12.14. The running time of the Lona DIRECTED CYCLE algorithm of Section|12.5]is dom- 
inated by the maximum of Equation when maximized over p and q. It is reasonable to 
assume that the choice of xp, is continuous, i.e., that 2p—1,q41 © Lp,q, and that therefore 
Sp—1,q+1 © p,q. Then Equation reduces to 

Sp,q ` (1 — p,q) 4 200? FD nO, 
For a fixed value of p and q, find the value of 0 < xp,q < 1 that minimizes the expression 
above. 


12.15. Improve Lemma |12.22}so that the algorithm for 4 MatTroip INTERSECTION only 
uses |U|OO) + 2°) operations over GF(s). 


12.16 ($). Consider the graphic matroid M of a clique on n vertices. Show that if p+q = 
n — 1, A is an edge set of size p and B is an edge set of size q, then A fits B if and only if 
AUB is a spanning tree of the clique. Use this fact, together with Theorem|[12.15] to give a 
2°(k)n OW) time deterministic algorithm for HAMILTONIAN CYCLE on graphs of treewidth 
at most k. You may assume that a tree-decomposition of G of width at most k is given as 
input. 
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Hints 


The problem is with the exchange property. Consider, for example, a path on an 
odd number of vertices, and two independent sets: one consisting of every odd vertex on 
the path, and one consisting of every even one. 


Let Sı and S2 be two independent sets in M with |S1] < |S2|. Let M; be a matching 
that witnesses that S; € F for i = 1,2. Since |M2| > |Mi|, the symmetric difference 
M,AMz2 contains a path P in G of odd length whose every odd edge belongs to Mz and 
every even edge belongs to Mı (i.e., an augmenting path for the matching M1). Observe 
that P contains exactly one vertex, say v, of S2 \ Sı and no other vertex of U \ Sj. 
Consequently, Mı AE(P) is a matching in G witnessing that S1 U {v} E€ F. 


For every e € E(G), create a variable xe. Consider a |B| x |U| matrix A where the 
entry at position (b, u), u € U, b € B, equals 0 if ub ¢ E(G), and £u» if ub € E(G). Fora 
set S C U, let Ag be the submatrix of A consisting of the columns corresponding to the 
vertices of S. The crucial observation is that S is independent in M if and only if there 
exists a set T C B of size |S| such that the determinant of submatrix As,r (consisting 
of rows and columns corresponding to vertices of T and S), treated as a multivariate 
polynomial, is nonzero. This follows from the permutation definition of the determinant: a 
nonzero monomial in det As,r corresponds to a perfect matching in G[S U T]. 

To finish the proof, it suffices to apply the Schwartz-Zippel lemma (Lemma [10.19) to 
observe that evaluating every variable xe to a random integer between 1 and x-n-2” with 
good probability does not turn any crucial nonzero polynomial into a zero. 


Let Ai and A2 be the bipartition classes of G. For i = 1,2, define M; as a direct 
sum of matroids M» for v € A;, where My, is the uniform matroid of rank 1 over the 
universe consisting of the edges incident to v. 


Let p > r be the rank of M and, by applying Gaussian elimination, assume that the 
matrix M representing M has p rows. Construct an r x p matrix B where at every cell 
(i,j) a new variable z; j is created. Consider the r x |U| matrix BM. Argue that a set S 
of columns is independent in BM (over the ring of multivariate polynomials) if and only if 
the corresponding set of elements of U is independent in MNU,. Use the Schwartz-Zippel 
lemma (Lemma [10.19) together with determinants to show that, if you randomly pick for 
every zi j a value in GF (p), then such an evaluated matrix BM represents M N Ur with 
good probability. 


Let Mı and Mə be the two input matroids over the same universe U. Make two 
copies of U, U; and U2, and modify each M; to operate over the universe U;. Consider 
the Matroip Parity instance consisting of the direct sum of Mı and Ma, and a set of 
pairs {{u1, u2} : u E U}, where u; is the copy of element u € U in the universe U;. 


The mentioned algorithm for d-HıTTING SET is the standard branching one: as long 
as there exists a set A’ € A\ {A;} that is disjoint with the currently constructed set B,, 
consider every possibility of adding an element of A’ \ A; into Bi. 


12.11] The family of all subsets of [p + q] of size exactly p does the job. 


Consider a construction U of (n, p + q)-universal sets. Observe that a set A of size 
p and a set B of size q are disjoint if and only if there is some S € U such that A C S and 
BNS =. What happens if we keep one set A € A, A C S, for every set S € U? 

To get rid of the logn factor, coming from the construction of the (n, p + q)-universal 
set, repeat the above procedure as long as it decreases the size of A by a constant factor. 
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12.15| Consider the direct sum of the matroids Mı, ... Mọ. Each subset of U of size k nat- 
urally corresponds to an independent set of size k£ in this new matroid. Use Theorem]|12.15 
on this matroid to get the algorithm. 


12.16| The straightforward dynamic-programming algorithm for HAMILTONIAN CYCLE 
may work as follows. For a bag Xz+, for every partition of X+ into Ao U Ai U Ag and every 
perfect matching M in a clique on the vertex set A1, we would like to know if there exists 
a subset F C Er such that 


1. every connected component of (V;, F) is a path with both endpoints in Ay, and V (F) = 
Vi \ Aos 

2. two vertices u,v E€ A; are the endpoints of the same path in (Vj, F) if and only if 
uve M. 


The 20(klogk)nO() time complexity comes purely from the number of choices of the 
matching M. 

Using representative sets, argue that it suffices, at every moment, for fixed bag X+ and 
sets Ao, Ai, and A2, to keep only a representative family of matchings M of size 20(*) 
for which the corresponding set F exists. To this end, observe that, if M represents part 
of the solution Hamiltonian cycle present in G+, and M’ represents the part of the cycle 
in G — Ez, then we care only about whether M U M’ is a Hamiltonian cycle in a clique on 
vertex set Ai. 


Bibliographic notes 


Matroid theory is a deep and well-studied field with a wealth of results. We refer to the 
textbook of Welsh or the book of Oxley for an introduction to the field. The 
randomized representation of transversal matroids of Theorem [12.3] can be found in the 
report of Edmonds and Fulkerson [162]. In this book we have chosen not to cover the 
deep and interesting theory of matroid minors. Highlights of matroid minor theory with 
consequences for fixed-parameter tractable algorithms include an “excluded grid theorem” 
for matroids by Geelen, Gerards, and Whittle, and an analogue of Courcelle’s theorem 
(Theorem|7.11) for matroids by Hlinény. 

The first polynomial-time algorithm for Marroip Parity is due to Lovász with 
running time O(n1”). The fastest known deterministic algorithm for MATROID PARITY is 
by Gabow and Stallmann [221], running in O(nr’) time where r is the rank of the input 
matroid and w < 2.37 is the matrix multiplication exponent (see [425]). The fastest ran- 
domized algorithm, building on work by Lovász [333], is due to Cheung, Lau, and Leung 
and has running time O(nr’—1). Even though the algorithm we give in this book for 2- 
MATROID INTERSECTION is by reduction to MATROID Parity, 2-MATROID INTERSECTION 
is considered an easier problem. A polynomial-time algorithm for 2-Matroip INTERSEC- 
TION can be found in the textbook by Schrijver [412]. The FPT algorithm for ¢-MatTro1p 
INTERSECTION of Exercise [12.15]is due to Marx [351], but using a slower algorithm for the 
computation of representative sets than the one presented in Theorem [12.15] resulting in 
a somewhat slower algorithm than what is stated in Exercise}12.15 

The first use of matroids for the FEEDBACK VERTEX SET was by Ueno, Kajitani, and 
Gotoh [423], who gave a polynomial-time algorithm for FEEDBACK VERTEX SET on cubic 
graphs. Ideas from this algorithm were later used by Cao, Chen, and Liu [70], who gave a 
3.83* nO) -time algorithm for FEEDBACK VERTEX SET. The 3.619*n0()-time algorithm 
for FEEDBACK VERTEX SET presented in this book is by Kociumaka and Pilipczuk [301]. 
Kociumaka and Pilipczuk also give the currently fastest known deterministic parame- 
terized algorithm for FEEDBACK VERTEX SET, with running time 3.592*n©). The fastest 
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known randomized algorithm for FEEDBACK VERTEX SET is by Cygan, Nederlof, Pilipczuk, 
Pilipczuk, van Rooij, and Wojtaszczyk [118], runs in 3°n°) time, and uses Cut & Count 
(see Section [11.2.1} together with iterative compression. 

The notion of representative sets can be thought of as an algorithmic variant of the 
classical Bollobas’ lemma and its generalization to linear matroids by Lovász [331 
(see Exercise [12.10}. The notion of representative sets was defined by Monien [368], where 
he gave an algorithm for Loncest PATH running in kO(K) nO) time. Monien [368 gave 
an algorithm computing a q-representative family for uniform matroids with running time 
O(|Alqp?+'), where A is the input p-family. The algorithm outputs a q-representative 
family of size O(p?). Marx gave an algorithm computing representative families for 
uniform matroids with O(|.A|?p%) running time, outputting a q-representative family of size 
(PT3), Subsequently Marx gave an algorithm computing representative families for 


linear matroids. The algorithm outputs a q-representative family of size at most (?*) in 


time oe (p+q)C|Al?. Fomin, Lokshtanov, and Saurabh proved Theorem [i2 13] 
this is the fastest currently known algorithm for computing representative sets of tinear 
matroids. In the same paper Fomin, Lokshtanov and Saurabh gave a faster algorithm 
for computing representative sets for uniform matroids, in particular they proved Corol- 
lary [12.32] Subsequently, Theorem[12.3I]was independently proved by Fomin, Lokshtanov, 
Panolan, and Saurabh and Shachnai and Zehavi [416]. This is the currently fastest 
known algorithm for computing representative sets of uniform matroids. 

Monien [368] was the first to use representative sets to design parameterized algo- 
rithms. Monien [368] used representative sets to give a kO(*)nO@)-time algorithm for 
Loncest Patu. Plehn and Voigt used representative sets and gave a kO(*)nO(®)- 
time algorithm for SUBGRAPH ISOMORPHISM where the pattern graph H has k vertices 
and treewidth t. Some more algorithms using representative sets, in particular an algo- 
rithm for L-MATROID INTERSECTION, were given by Marx [351]. With respect to kerneliza- 
tion, Kratsch and Wahlström gave a randomized polynomial kernel for ODD CYCLE 
TRANSVERSAL by encoding the problem as a problem on a certain kind of linear matroid, 
called gammoids. Kratsch and Wahlström [310] used representative sets of linear matroids 
to also give a randomized polynomial kernel for ALmMost 2-SAT, as well as for several 
other interesting problems. The kernel for d-Hittine SET presented in Section is 
due to Kratsch and Wahlström [311]. The kernel for d-SeT PackiNG in section ES 
due to Zehavi [442]. 

The Lona DIRECTED CYCLE problem was shown fixed-parameter tractable by Gabow 
and Nie [220], who gave a kO(*)n©(Q)-time algorithm for the problem. Fomin, Lokshtanov, 
and Saurabh gave the first c*nO°\) algorithm for Lona DIRECTED CYCLE by using 
representative sets. The algorithm of Fomin, Lokshtanov, and Saurabh is slightly 
slower than the one stated in Theorem because they use Corollary|12.32|to compute 
representative sets, rather than Te ea Theorem [12.35] i.e., a 6.75% nO()-time 
algorithm for Lona DIRECTED CYCLE, was proved independently by Fomin, Lokshtanov, 
Panolan, and Saurabh and Shachnai and Zehavi [416]. 
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Chapter 13 
Fixed-parameter intractability 





In this chapter, we use parameterized reductions and 
W(1]-hardness to provide evidence that certain param- 
eterized problems are unlikely to be fixed-parameter 
tractable. 





The goal of the previous chapters was to present the most fundamental 
algorithmic techniques for showing the fixed-parameter tractability of param- 
eterized problems. In this chapter, we study the complementary questions on 
lower bounds: how it is possible to show (provide evidence) that a parame- 
terized problem is not fixed-parameter tractable. 

In the earliest days of computer science, when the first algorithms were 
developed to solve combinatorial problems of practical interest such as the 
Traveling Salesperson Problem, it was observed with frustration that ev- 
ery proposed algorithm had an inconveniently large worst-case upper bound 
on the running time, typically exponential in the size of the input. Several 
decades of research have not been able to explain and prove why exponen- 
tial time is needed for these problems. The theory of NP-completeness at 
least showed that there is one common underlying reason for the lack of 
polynomial-time algorithms, and therefore conditional lower bounds based 
on NP-completeness are the best we can have at this point of time. The fact 
that computation cannot be cheap seems to be a fundamental principle and 
understanding the exact reason for this apparent hardness is a deep question 
that is well beyond our current knowledge. 

Our goal in this chapter is to develop a lower bound theory for parame- 
terized problems, similar to the NP-completeness theory of polynomial-time 
computation. Instead of taking this as an opportunity to delve deeper into the 
mysteries of efficient computation, we adopt the pragmatic viewpoint of the 
algorithm designer: our focus is on presenting evidence for as many problems 
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as possible that algorithms with certain specifications do not exist. Being 
aware of and being able to produce such negative results is essential for the 
algorithm designer: without them, countless hours can be wasted on trying 
to prove results that contradict commonly accepted assumptions. Knowing 
that certain problems are (probably) not fixed-parameter tractable prevents 
us from attacking the same problem over and over again with little hope of 
progress. Furthermore, the lower-bound theory helps the algorithm designer 
in a more subtle way as well. In many cases, it is very helpful to look at 
a problem from both the algorithmic and the complexity viewpoints at the 
same time. A failed attempt at finding an algorithm can highlight certain 
difficult situations, giving insight into the structure of hard instances, which 
can be the starting point of a hardness proof. Conversely, if one can pinpoint 
a specific reason why a proposed hardness proof cannot be made to work, 
then it may suggest an algorithmic idea that can renew the attack on the 
algorithmic side of the question. The authors of this textbook have experi- 
enced many times that an answer was found only after repeatedly jumping 
back and forth between attacks from the algorithmic side and the complexity 
side of the problem. Moreover, the hardness proofs might tell us what special 
cases or problem variants merit further exploration. Hence the lower bound 
theory helps not only in finding the answers, but can even steer the algorithm 
designer towards the right formulation of the questions. 

As we have no proof of P Æ NP, we cannot rule out the possibility that 
problems such as CLIQUE and DOMINATING SET are polynomial-time solv- 
able and hence FPT. Therefore, our lower bound theory has to be conditional: 
we are proving statements of the form “if problem A has a certain type of 
algorithm, then problem B has a certain type of algorithm as well.” If we 
have accepted as a working hypothesis that B has no such algorithms (or 
we have already proved that such an algorithm for B would contradict our 
working hypothesis), then this provides evidence that problem A does not 
have this kind of algorithm either. To prove such statements in the context 
of fixed-parameter tractability, we need a notion of reduction that trans- 
fers (negative evidence for) fixed-parameter tractability from one problem 
to the other. As we shall see, the standard notion of polynomial-time re- 
duction used in NP-completeness theory is not sufficient for our purposes. 
Section [13-1] introduces the notion of parameterized reductions, which have 
a slightly different flavor than NP-hardness proofs and therefore require a 
different toolbox of techniques. Then Section [13.2] presents a selection of ba- 
sic reductions from CLIQUE to various problems. If we accept as a working 
hypothesis that CLIQUE is not fixed-parameter tractable, then these reduc- 
tions from CLIQUE are practical evidence that certain other problems are not 
fixed-parameter tractable either. The assumption that CLIQUE is not fixed- 
parameter tractable is a stronger assumption than P Æ NP, but it seems that 
we need such a strong assumption, as we currently do not know how to base 
fixed-parameter intractability results on assuming P 4 NP only. 
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A remarkable aspect of NP-completeness is that there are literally thou- 
sands of natural hard problems that are equally hard in the sense that they are 
reducible to each other. The situation is different in the case of parameterized 
problems: there seem to be different levels of hardness and even basic prob- 
lems such as CLIQUE and DOMINATING SET seem to occupy different levels. 
Downey and Fellows introduced the W-hierarchy in an attempt to classify pa- 
rameterized problems according to their hardness. We give a brief overview of 
this hierarchy in Section The CLIQUE problem is W[1]-complete, that 
is, complete for the first level of the W-hierarchy. Therefore, CLIQUE not 
being fixed-parameter tractable is equivalent to FPT # W[1]. This is the ba- 
sic assumption of parameterized complexity; we interpret W[1]-hardness as 
evidence that a problem is not fixed-parameter tractable. In Section 
we connect the class W[1] to a fundamental computational problem about 
Turing machines. This connection provides further evidence supporting the 
assumption FPT 4 W[1]. Moreover, it allows a very convenient way of prov- 
ing membership in W/[1]. In Section we use this and other arguments 
to conclude that all the problems studied in Section [13.2] are actually com- 
plete either for W[1] or for W[2]. In Section [13.6] we finish the chapter with 
a selection of W{1]-hardness results, demonstrating some of the challenges 
one faces in typical hardness proofs and ways of overcoming these difficul- 
ties. We remark that Section [14.4-1|contains further important parameterized 
reductions. 

Following our pragmatic viewpoint, we omit here the discussion of issues 
that belong to structural complexity theory and hence are not essential to 
the algorithm designer. What complexity classes can be defined and which 
problems are complete for them? Which of these classes are equal? Are there 
intermediate problems between two classes? What notions of reductions can 
we define? Can we increase the strength of our negative evidence by basing it 
on weaker conjectures? What role does nonuniformity play in the complexity 
of the problems? These are fine theoretical questions, but are not of immedi- 
ate importance to the algorithm designer who only wants some evidence that 
the answer to a particular algorithmic question is negative. 

Finally, let us comment on the fact that all the lower bounds of this and the 
following chapters are conditional on various conjectures. The reader might be 
skeptical of the validity of these lower bounds, as they are based on unproven 
assumptions. In particular, the strongest of these conjectures, SETH, is still 
somewhat controversial and not fully accepted by the computational com- 
plexity community. Nevertheless, these lower bounds still send an important 
and useful message. The reader might dismiss these lower bounds and spend 
time on working towards an algorithmic result violating these lower bounds. 
But then the reader should be aware that he or she is actually working to- 
wards disproving one of the basic conjectures and the difficulty of proving 
the algorithmic result is not specific to the problem at hand, but it requires 
answering a basic well-studied question first. Then one could argue that per- 
haps it is more efficient to spend time on concentrating on this basic question 
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directly, rather than answering it indirectly via some more problem-specific 
algorithmic question. 


13.1 Parameterized reductions 


Let us recall the standard notion of polynomial-time reduction used in NP- 
hardness proofs. A polynomial-time many-ond!| reduction from problem A 
to problem B is a polynomial-time algorithm that, given an instance x of 
problem A, outputs an equivalent instance x’ of problem B, that is, x is a 
yes-instance of problem A if and only if 2’ is a yes-instance of problem B. 
If there is such a reduction from A to B and B is polynomial-time solvable, 
then A is also polynomial-time solvable: given an instance x of problem A, 
we can run the reduction to get an instance x’ of problem B, which we can 
solve using the assumed polynomial-time algorithm for B. 

We need an analogous notion of reduction for parameterized problems that 
transfers fixed-parameter tractability. 


Definition 13.1 (Parameterized reduction). Let A,B C X* x N be two 
parameterized problems. A parameterized reduction from A to B is an algo- 
rithm that, given an instance (z,k) of A, outputs an instance (2’,k’) of B 
such that 


1. (x, k) is a yes-instance of A if and only if (a’, k’) is a yes-instance of B, 
2. k’ < g(k) for some computable function g, and 
3. the running time is f(k) - |a|°“ for some computable function f. 


As a minor technical remark, let us observe that it can be assumed that the 
functions f and g are nondecreasing: for example, we may replace the com- 
putable function g(k) with g(k) = max*_, g(k) > g(k), which is a computable 
nondecreasing function. 

Before comparing this notion of reduction to classic polynomial-time re- 
ductions, let us formally state and prove that parameterized reductions work 
as intended. 


Theorem 13.2. If there is a parameterized reduction from A to B and B is 
FPT, then A is FPT as well. 


Proof. Let (x,k) be an instance of problem A. The parameterized reduction 
gives an equivalent instance (2’,k’) in time f(k)|a|@ with k’ < g(k) and 
\x’| < f(k)|x|* (clearly, the running time of the reduction is an upper bound 
on the size of the instance created). Suppose that B has an algorithm with 
running time h(k)n@; by the earlier discussion, we may assume that h is 





1 The terminology “many-one” refers to the fact that many instances of problem A might 
be mapped to the same instance of problem B. Some authors use the term Karp-reduction 
for such a reduction. 
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nondecreasing. Then running this algorithm on (2’, k’) determines whether 
(x’,k’) is a yes-instance of B (equivalently, whether (x, k) is a yes-instance 
of A) in time at most 


h(k’)|a"| < h(g(k))(f(k) ||)? 


(the inequality uses the fact that h is nondecreasing). Together with the time 
required for the parameterized reduction, the total running time is at most 
f'(k)|a|“@, where f’(k) = f(k) + h(g(k))f(k)@ is a computable function. 
That is, the problem B is fixed-parameter tractable. 














The second item in Definition [13.1] requires something that is not re- 
quired for classic polynomial-time reductions: the new parameter should 
be bounded by a function of the parameter of the original instance. 
Therefore, not every NP-hardness proof gives a parameterized reduc- 
tion. 


There is a very simple reduction from INDEPENDENT SET to CLIQUE: the 
size of the maximum independent set in graph G is the same as the size of the 
maximum clique in the complement G. Therefore, given an instance (G, k), 
creating an instance (G,k) is a polynomial-time reduction from INDEPEN- 
DENT SET to CLIQUE. In fact, this is a parameterized reduction: the new 
parameter is the same in the original instance, hence g(k) = k is a trivial 
bound on the new parameter. The same reduction can be used in the other 
direction as well, that is, to reduce CLIQUE to INDEPENDENT SET. Therefore, 
these two problems are equally hard in the sense that one is FPT if and only 
if the other is. 

It is a well-known fact that an n-vertex graph G has an independent set of 
size k if and only if it has a vertex cover of size n — k. Therefore, (G,k) is a 
yes-instance of INDEPENDENT SET if and only if (G,n—k) is a yes-instance of 
VERTEX COVER. This immediately gives a polynomial-time reduction from 
INDEPENDENT SET to VERTEX COVER. However, this is not a parameterized 
reduction: as n can be arbitrarily large compared to k, there is no function 
g(k) such that n — k < g(k). We do not expect that there is a parameterized 
reduction from INDEPENDENT SET to VERTEX COVER: as the latter problem 
is FPT, the existence of such a reduction would imply by Theorem [13.2] that 
INDEPENDENT SET is FPT as well, which we do not believe. 

The third item in Definition [13.1Jallows the running time of the reduction 
to be more than polynomial: it can have an additional factor depending only 
on the parameter k. This means that not every parameterized reduction is a 
polynomial-time reduction and if there is a parameterized reduction from A 
to B such that the unparameterized version of A is NP-hard, then this does 
not necessarily imply that B is NP-hard. As an artificial example, consider the 
problem CLIQUE;9,, where (G, k) is a yes-instance if k < log, |V(G)| and G 
has a clique of size k. The following simple transformation is a parameterized 
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reduction from CLIQUE to CLIQUEjog: given an instance (G, k), we output the 
instance (G’,k), where G” is obtained from G by adding 2* isolated vertices. 
Then k < log, |V(G’)| always holds, and hence (G’,k) is a yes-instance of 
CLIQUE jog if and only if G has a clique of size k. However, CLIQUE}og can be 
solved in time |V(G)|O(°82!"(@)) by brute force, as the answer is trivial if 
k > log, |V(G)|. That is, the problem can be solved in quasi-polynomial time 
nO les”) which we do not expect to be possible for NP-hard problems. 


In general, parameterized reductions and polynomial-time reductions 
are incomparable. However, the vast majority of parameterized reduc- 
tions (including most reductions appearing in this chapter) are actually 
polynomial-time reductions. 


Later in this section, a hardness proof for DOMINATING SET ON TOUR- 
NAMENTS will be a more natural example of a parameterized reduction with 
running time exponentially depending on the parameter k. 


13.2 Problems at least as hard as CLIQUE 


In this section, we present a series of parameterized reductions showing that 
CLIQUE can be reduced to various basic problems. This means that these 
problems are at least as hard as CLIQUE in the sense that if any of them is 
FPT, then it follows that CLIQUE is FPT as well. This is strong evidence 
that none of the problems considered in this section is FPT. 

To be more precise, the reductions given in this section are not all from 
CLIQUE. We use the fact, stated in the following theorem, that parameterized 
reductions compose: that is, a reduction from A to B and reduction from B to 
C implies that there is a reduction from A to C. Therefore, to show that there 
is a parameterized reduction from CLIQUE to a problem C, it is sufficient to 
reduce a problem B to C, where B is a problem to which we already have a 
reduction from CLIQUE. 


Theorem 13.3. If there are parameterized reductions from A to B and from 
B to C, then there is a parameterized reduction from A to C. 


Proof. Let Rı be a parameterized reduction from A to B (with running 
time f1(k)-|z|“ and parameter bound gi(k)) and let Rə be a parameterized 
reduction from B to C (with running time f2(k) -|x| and parameter bound 
g2(k)). As discussed after Definition we may assume that the functions 
fi, f2, 91, g2 are nondecreasing. Given an instance (z,k) of A, reduction 
R first uses Rı to create an equivalent instance (x’,k’) of B (with |a’| < 
fi(k)- |x|% and k’ < gi(k)), and then uses reduction Ra on (2’,k’) to create 
an equivalent instance of C (with k” < go(k’)). It is clear that (x, k) is 
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a yes-instance of A if and only if (x”,k”) is a yes-instance of C. We have 
k” < go(k’) < ge(gi(k)) (using the fact that go is nondecreasing), thus k” is 
bounded by a computable function of k. The running time of the reduction 
is 


Falk) + |x| + folk’) + a"? < falk): lal + falgi(k)) « (fi) ea). 
< (filk) + falga (k) fi (k)) - [alo 
= f(r) |æ”, 


where f*(k) = fi(k) + fo(gi(k)) fı (k) is a computable function (we use in the 
first inequality the fact that fo is nondecreasing). Thus reduction R satisfies 
all requirements of Definition [13-1] 














It is common practice to prove NP-hardness for the most restricted version 
of a problem, as this can be convenient when the problem becomes the source 
of a reduction later: it can be easier to devise a reduction if input instances 
of only some restricted form need to be considered. The same principle holds 
for parameterized problems and in particular for CLIQUE, which is the source 
of most parameterized reductions. We show first that CLIQUE remains hard 
even if the graph is regular, that is, every vertex has the same degree. 


Theorem 13.4. There is a parameterized reduction from CLIQUE to CLIQUE 
on regular graphs. 


Proof. Given an instance (G, k) of CLIQUE, we create a graph G” as described 
below and output the instance (G’,k). If k < 2, then the CLIQUE problem 
is trivial, hence we can output a trivial yes- or no-instance. Let d be the 
maximum degree of G. 


(i) Take d distinct copies G4, ..., Gg of G and let v; be the copy of v € V(G) 
in graph G;. 

(ii) For every vertex v € V(G), let us introduce a set V, of d—dg(v) vertices 
and add edges between every vertex of V, and every v; for 1 < i < d. 


Observe that every vertex of G’ has degree exactly d. To prove the correctness 
of the reduction, we claim that G has a k-clique if and only if G’ has. The 
left to right implication is clear: copies of G appear as subgraphs in G’, 
thus any clique in G gives a corresponding clique in G’. For the reverse 
direction, observe that the vertices introduced in step (ii) do not appear in 
any triangles. Therefore, assuming k > 3, these vertices cannot be part of 
a k-clique. Removing these vertices gives d disjoint copies of G, thus any 
k-clique appearing there implies the existence of a k-clique in G. 














As the complement of a regular graph is also regular, we get an analogous 
result for INDEPENDENT SET. 


Proposition 13.5. There is a parameterized reduction from CLIQUE to IN- 
DEPENDENT SET on regular graphs. 
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Note that we have restricted the graph to be regular, but we did not specify 
that the graph is, say, 3-regular or has any other fixed degree bound. Indeed, 
CLIQUE and INDEPENDENT SET are both FPT on r-regular graphs for every 
fixed integer r, and moreover, FPT with combined parameters k + r (see 
Exercise [3.2). This has to be contrasted with the fact that INDEPENDENT 
SET is NP-hard on 3-regular graphs. 

The following reduction shows an example where we can exploit the fact 
that the graph appearing in the CLIQUE instance is regular. The input to the 
PARTIAL VERTEX COVER problem is a graph G with two integers k and s, 
and (G, k, s) is a yes-instance if G has a set S of k vertices that covers at least 
s edges (we say that a set S covers an edge xy if at least one of x and y is in 
S). VERTEX COVER is the special case s = |E(G)|; the following reduction 
shows that the general PARTIAL VERTEX COVER problem is probably harder 
than VERTEX COVER. 


Theorem 13.6. There is a parameterized reduction from INDEPENDENT SET 
on regular graphs to PARTIAL VERTEX COVER parameterized by k. 


Proof. Let (G,k) be an instance of INDEPENDENT SET, where G is an r- 
regular graph. We claim that G has an independent set of size k if and 
only if (G,k,rk) is a yes-instance of PARTIAL VERTEX COVER. If G has an 
independent set S of size k, then every edge of G is covered by at most one 
vertex of S, hence together they cover exactly rk edges. Conversely, suppose 
that G has a set S of k vertices covering rk edges. As each vertex of G covers 
exactly r edges, this is only possible if no edge of G is covered twice by these 
k vertices, that is, if they form an independent set. 














Note that PARTIAL VERTEX COVER is fixed-parameter tractable parameter- 
ized by s, the number of edges to cover (Exercise [5.11}. 

Next we consider a version of the CLIQUE problem that is a useful starting 
point for hardness proofs. The input of MULTICOLORED CLIQUE) (also called 
PARTITIONED CLIQUE) consists of a graph G, an integer k, and a partition 
(Vi, ..-, Vk) of the vertices of G; the task is to decide if there is a k-clique 
containing exactly one vertex from each set Vj. 


Theorem 13.7. There is a parameterized reduction from CLIQUE on regular 
graphs to MULTICOLORED CLIQUE on regular graphs. 


Proof. Let (G,k) be an instance of CLIQUE, where G is an r-regular graph 
on n vertices. We construct an instance (G’,k, (Vi,...,V%)) as follows. For 
every vertex v € V(G), we introduce k vertices v1, ..., Uk into G”. The set 
V; contains v; for every v € V(G). We define the edges of G” as follows: 


(i) Every V; is a set of n independent vertices. 





2 The name “multicolored” comes from an alternate way of defining the problem: we can 
say that the vertices of G are colored by k colors and we are looking for a clique where 
every vertex has a distinct color. 
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(ii) If u and v are different and adjacent in G, then we make u; and vj 
adjacent for every 1 < i,j < k, iF j. 


Observe that the degree of every vertex in G” is (k— 1)r: vertex v; is adjacent 
to r vertices from each V; with i Æ j. 

We claim that G has a k-clique if and only if G” has a k-clique with exactly 
one vertex from each V;. Suppose that vt, ..., v? are the vertices of a k-clique 
in G (ordered arbitrarily). Then vj € Vj, ..., vf € Vp is a k-clique in G”. 
Conversely, suppose that v! € Vi, ..., vk € Vp is a k-clique in G’. It follows 
from the definition of G’ that v? and vÍ are only adjacent in G’ if vê and v/ 
are different and adjacent in G. Therefore, {v!,...,v*} induces a k-clique in 


G. 














One may analogously define the multicolored version of INDEPENDENT SET, 
where the k vertices of the independent set have to include one vertex from 
each class of the given partition (V;,...,V,). Taking the complement of the 
graph is a parameterized reduction between CLIQUE and INDEPENDENT SET. 
Observe that this also holds for the multicolored version. 


Corollary 13.8. There are parameterized reductions between MULTICOL- 
ORED CLIQUE on regular graphs and MULTICOLORED INDEPENDENT SET 
on regular graphs. 


Our next reduction gives evidence for the fixed-parameter intractability of 
DOMINATING SET. 


Theorem 13.9. There is a parameterized reduction from MULTICOLORED 
INDEPENDENT SET to DOMINATING SET. 


Proof. Let (G,k,(Vi,...,Ve)) be an instance of MULTICOLORED INDEPEN- 
DENT SET. We construct a graph G” the following way (see Fig. {13.1). 


(i) For every vertex v € V(G), we introduce v into G’. 
(ii) For every 1 <i < k, we make the set V; a clique in G’. 
(iii) For every 1 < i < k, we introduce two new vertices x;,y; into G” and 
make them adjacent to every vertex of Vj. 
(iv) For every edge e € E(G) with endpoints u € V; and v € V}, we introduce 
a vertex we into G” and make it adjacent to every vertex of (V; UV;) \ 


{u,v}. 


The intuitive idea of the reduction is the following. The vertices x; and 
yi ensure that a dominating set of size k has to select exactly one vertex 
from each V;. The vertices we ensure that the selected vertices form an 
independent set in the original graph: if both endpoints of an edge e 
are selected, then the vertex we is not dominated. 
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Fig. 13.1: The reduction from MULTICOLORED INDEPENDENT SET to DOM- 
INATING SET in Theorem [13.9] The set V; U {x;, yi} induces a clique minus 
the edge xiyi. If edge e connectes u € V; and v € V}, then we introduce a 
vertex we adjacent to (V; UV;) \ {u,v} 


Formally, we claim that G has an independent set with exactly one vertex 
from each V; if and only if G” has a dominating set of size k. Suppose that 
there is an independent set J in G with one vertex from each V;; we show 
that it is a dominating set in G”. First, as INV; Æ 0, the set J dominates the 
set V; U {x;, yi}. Consider now a vertex we, where u € V; and v € V; are the 
endpoints of edge e € E(G). As u and v are adjacent, at least one of them 
is not in I. In other words, I contains a vertex of either V; \ {u} or Vj \ {v}, 
that is, 7 contains a vertex of (V; UV;) \ {u,v}, which dominates we. 

To prove the reverse direction of the equivalence, suppose now that D is a 
dominating set of size k in G”. To dominate vertices x; and y;, the set D has 
to contain at least one vertex from V; U {z;, yi}. As these sets are disjoint for 
different values of 7, we can assume that D contains exactly one vertex from 
each of V; U {a;, yi}. As 2; and y; are not adjacent, this vertex of D cannot 
be x; or yi. Therefore, let us assume that the dominating set D consists of 
the vertices vı € Vi, ..., Uk E Ve. We claim that D is an independent set in 
G. Suppose that v; and v; are the endpoints of an edge e. Then vertex we 
of G’ is adjacent only to (V; UV;) \ {u;, vj}, and hence D does not dominate 
We, a contradiction. 














As there is a very simple reduction between (multicolored versions of ) CLIQUE 
and INDEPENDENT SET, a reduction from one can be reinterpreted to be a 
reduction from the other with minimal changes. However, sometimes choosing 
one of the two problems is more convenient notationally or conceptually; for 
example, in the proof of Theorem [13.9] it is convenient notationally that (iv) 
adds a new vertex we for every edge e (rather than for every nonedge). 
Given a set system F over a universe U and an integer k, the SET COVER 
problem asks if there are k sets in F such that their union is U. In this 
chapter, we always parameterize SET COVER by the number k of selected 
sets in the solution. The following easy reduction shows that SET COVER 
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is as hard as DOMINATING SET (Exercise asks for a reduction in the 
reverse direction). 


Theorem 13.10. There is a parameterized reduction from DOMINATING SET 
to SET COVER. 


Proof. Let (G,k) be an instance of DOMINATING SET. We create an instance 
(F,U, k) of SET Cover as follows. We let U := V (G) and for every v € V(G), 
we introduce the set Ng[v] (the closed neighborhood of v) into F. Suppose 
that D is a dominating of size k in G. Then the union of the corresponding 
k sets of F covers U: an uncovered element would correspond to a vertex of 
G not dominated by D. Conversely, if the union of k sets in F is U, then the 
corresponding k vertices of G dominate every vertex: a vertex not dominated 
in G would correspond to an element of U not covered by the k sets. 














On directed graphs, we define dominating set in the following way: a set 
D C V(G) is a dominating set of G if for every vertex v € V(G) \ D, there is 
a vertex u € D such that edge (u,v) is in G. Equivalently, we can say that 
we want the union of the closed outneighborhoods to cover every vertex. It is 
easy to reduce DOMINATING SET on undirected graphs to DOMINATING SET 
on directed graphs: we can replace each undirected edge xy with two directed 
edges (x,y) and (y, x) (in other words, we replace xy with a bidirected edge 
between x and y). Furthermore, the reduction from DOMINATING SET to 
SET COVER appearing in Theorem [13.10|can be adapted to directed graphs. 
Therefore, DOMINATING SET on undirected graphs, DOMINATING SET on 
directed graphs, and SET COVER are reducible to each other. 

A very interesting special case of DOMINATING SET on directed graphs is 
when the input graph is a tournament, that is, for every pair u,v of distinct 
vertices, exactly one of (u,v) and (v, u) is in the graph. It is easy to see that 
every tournament on n vertices has a dominating set of size O(log n). 


Lemma 13.11. Every tournament on n vertices has a dominating set of size 
at most 1 + logn. 


Proof. We prove the claim by induction on n. For n = 1 it is trivial, so let 
us assume n > 2. 


Since T has (5) edges, the sum of outdegrees of all the vertices is equal 


=i ae . 
io) = nent) As there are n vertices in T, there exists some vertex u 


with outdegree at least r=. Let U+ be the set of outneighbors of u, and 


let UT be the set of inneighbors of u. Then Ut and U~ form a partition of 
V(T) \ {u}, and |U~| < 25+ < [8]. Since |2] <n, by induction hypothesis 
we may find a set D’ C U~ that is a dominating set in T[U~] and has at 
most 1 +log| 4] < logn vertices. Then D := D'U {u} is a dominating set in 


T, and has size at most 1 + logn. 














Therefore, DOMINATING SET ON TOURNAMENTS can be solved by brute 
force in time n0087), making it very unlikely that it is NP-hard. Is there 
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such a spectacular collapse of complexity compared to general directed graphs 
also for the parameterized version of the problem? Surprisingly, we are able 
to show that this is not true: there is a parameterized reduction from SET 
COVER to DOMINATING SET ON TOURNAMENTS. 

The crucial building block in our reduction will be a construction of suffi- 
ciently small tournaments that do not admit a dominating set of size k. Such 
tournaments are traditionally called k-paradozical, and were studied inten- 
sively in combinatorics. Note that by Lemma13.11] a k-paradoxical tourna- 
ment must have at least 2% vertices. It turns out that once we increase the size 
of the vertex set slightly over this threshold, the k-paradoxical tournaments 
start to appear. 


Theorem 13.12. For every n > rp = 2-2" . k? there exists a tournament on 
n vertices that does not admit a dominating set of size k. 


The proof of Theorem due to Erdés, is a classic application of the 
probabilistic method: a randomly sampled tournament on at least rz, vertices 
does not admit a dominating set of size k with positive probability. The reader 
is asked to verify this fact in Exercise [13.4] We remark that the multiplicative 
constant 2 in the function rẹ can be in fact substituted with In 2 + o;,(1). 

In our reduction we need to construct a k-paradoxical tournament, and 
hence it seems necessary to have a constructive, deterministic version of The- 
orem Apparently, there exist deterministic constructions that achieve 
similar asymptotic size of the vertex set as Theorem [13.12] in Exercise 
the reader is asked to work out details of a simple deterministic construction 
with slightly worse bounds. We can, however, circumvent the need of a deter- 
ministic construction at the cost of allowing the construction to run in doubly 
exponential time in terms of k. The running time will be still fixed-parameter 
tractable, which is sufficient for our purposes. 


Lemma 13.13. There exists an algorithm that, given an integer k, works in 


time CF) gr -kO and outputs a tournament Ty that has rp vertices and 
does not admit a dominating set of size k. 

Proof. We construct Tk by iterating through all the 2(¥) tournaments on Tk 
vertices, and for each of them verifying in time oo -kO) whether it is k- 
paradoxical by checking all the k-tuples of vertices. Theorem[13.12|guarantees 
that one of the verified tournaments will be in fact k-paradoxical. 














We are finally ready to provide the reduction. 


Theorem 13.14. There is a parameterized reduction from SET COVER to 
DOMINATING SET ON TOURNAMENTS. 


Proof. The reduction starts with an instance (F,U,k) of SET COVER, and 
outputs an equivalent instance (T,k +1) of DOMINATING SET ON TOURNA- 
MENTS. The first step is a construction of a (k + 1)-paradoxical tournament 
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Vu Vw 


VF 





Fig. 13.2: A schematic view of the construction of Theorem for a uni- 
verse of size 6, family of size 5, and S being a cyclic tournament on three 
vertices. Edges adjacent to the sets Ve, and Vw; represent appropriately ori- 
ented complete bipartite graphs, and edges created in (i) between Vr and Vy 
have been depicted only for the set X3 = {e1, €2, es} 


S = Tr41 on rk4ı vertices using Lemma {13.13} this contributes a factor 
r(k+1) 
2") r(k+1)*+?.(k4+1)° to the running time, which is just a function 
of k. 
The vertex set of the constructed tournament T is defined as follows: 


(i) For every e € U, create a set of r41 vertices Ve = {Vew : we V(S)}, 
one for each vertex of S. Let Vi = {Vew : e € U}, and let Vy = 
Uccu Ve = Uwevis) Vw. 

(ii) For every X € F, create one vertex vy. Let Vr = {vx : XEF}. 

(iii) Moreover, create one vertex v*. 


We now create the edge set of T. 


(i) For every set X € F and every element e € U, if e € X then introduce 
an edge from vx to every vertex of V., and if e ¢ X then introduce an 
edge from every vertex of V to vx. 

(ii) For every set X € F, introduce an edge (v*, vx). 

(iii) For every element e € X and w € V(S), introduce an edge (vew, vu"). 

(iv) For every w1, w2 E€ V(S) with w 4 we, introduce an edge from every 
vertex of Vu, to every vertex of Vw, if (wi,w2) E€ E(S), and introduce 
the reverse edges if (w2, w1) E€ E(S). 

(v) For every w € V(S), put edges between vertices of V,, arbitrarily. 

(vi) Finally, put the edges between vertices of Vz arbitrarily. 


It is easy to see that the constructed digraph T is indeed a tournament, and 
that the construction can be performed in time polynomial in |U]|, |F|, and 
rp+i (provided S is already constructed). We now claim that T admits a 
dominating set of size k + 1 if and only if the input instance (F,U,k) is a 
yes-instance. 
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Assume first that G C F is a subfamily of size at most k such that UG = U. 
Consider D = {v*} U {vx : X € G}. Clearly |D| < k +1, and observe that 
D is a dominating set of T: each vertex of Vz is dominated by v*, while each 
vertex Vew E Vy is dominated by a vertex vx € D for X € G such that 
ecx. 

Conversely, suppose that T admits a dominating set D such that |D| < 
k+1. Since D has to dominate v*, either D contains v* or at least one vertex 
of Vy. Consequently, |D N V| < k. Let G = {X € F : vx € D}. Clearly 
|G| < k, so it suffices to prove that UG = U. 

For the sake of contradiction assume that there exists some eg € U that 
does not belong to any set of G. Let Z C V(S) be the set of all vertices 
z € V(S) such that DN V; 4. Since |D| < k +1, we also have |Z| <k +1. 
Since S is (k + 1)-paradoxical, we have that there exists some vertex wo € 
V(S) that is not dominated by Z in S. Consider now the vertex ve... in 
T. By the definition of the edges between Vz and Vy, introduced in (i), we 
have that vej,w, is not dominated by D N Vz, since then eg would belong 
to UG by the definition of G. Since Z does not dominate wo in S, by the 
definition of the edges introduced in (iv) it also follows that DM Vy does not 
dominate v.,.~,- Note that in particular wo ¢ Z, so DN Vy, = 0 and ve, wo 
cannot be dominated from within V,,,. Finally, veo,w) cannot be dominated 
by D N {v*}, since v* does not have any outneighbor in Vy. We infer that 
Veo,wo iS not dominated by D at all, which contradicts the assumption that 
D is a dominating set in T. 














Note that, unlike most parameterized reductions in this book and also 
in the literature, the reduction in Theorem [13.14] is not a polynomial-time 
reduction: the size of the constructed instance has exponential dependence 
on k and the running time depends double exponentially on k. Therefore, 
this reduction crucially uses the fact that property 3 in Definition [13.1|allows 
f(k) |a|\O™ time instead of just |a|O™. 

CONNECTED DOMINATING SET is the variant of DOMINATING SET where 
we additionally require that the dominating set induce a connected graph. 
Not surprisingly, this version of the problem is also hard. 


Theorem 13.15. There is a parameterized reduction from DOMINATING SET 
to CONNECTED DOMINATING SET. 


Proof. Let (G,k) be an instance of DOMINATING SET. We construct a graph 
G’ the following way. 


(i) For every vertex v € V(G), we introduce two adjacent vertices v1, v2. 
(ii) We make the set {v; : v € V(G)} a clique K of size |V(G)]. 
(iii) We make vı and u2 adjacent if v and u are adjacent in G. 


We claim that (G,k) is a yes-instance of DOMINATING SET if and only if 
(G’,k) is a yes-instance of CONNECTED DOMINATING SET. Suppose first 
that S = {v!,...,v*} is a dominating set of size k in G. Then we claim that 
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S’ ={ovt,..., vf} is a connected dominating set of size k in G”. Clearly, G’[S”] 
is a clique and hence it is connected. To see that S’ is a dominating set in 
G’, observe that vj dominates K, and if u is dominated by v’ in G, then uz 
is dominated by vý in G”. 

For the proof of the reverse direction of the equivalence, let S’ be a con- 
nected dominating set of size k in G”. Let v be in S if at least one of vı and 
və is in S’; clearly, |S| < |S’| = k. We claim that S is a dominating set of G. 
Consider any vertex u € V(G). Vertex uz of G’ is dominated by some vertex 
vı or vg that belongs to S’. Then v is in S and, by the construction of G”, it 
dominates u in G, as required. 














13.3 The W-hierarchy 


As we have discussed above, most of the natural NP-hard problems are equiv- 
alent to each other with respect to polynomial-time reductions (this holds for 
problems that are NP-complete), but this does not seem to be true for hard 
parameterized problems. For example, we have shown that (MULTICOLORED) 
INDEPENDENT SET can be reduced to DOMINATING SET (Theorem|13.9), but 
it is not known if there is a parameterized reduction in the other direction. 
This suggests that, unlike in the case of NP-complete problems, there is a hier- 
archy of hard parameterized problems, with, for example, INDEPENDENT SET 
and DOMINATING SET occupying different levels of this hierarchy. Downey 
and Fellows introduced the W-hierarchy in an attempt to capture the exact 
complexity of various hard parameterized problems. In this section, we briefly 
overview the fundamental definitions and results related to this hierarchy. It 
has to be noted, however, that if our only concern is to provide some evidence 
that a specific problem is not fixed-parameter tractable (which is the usual 
viewpoint of an algorithm designer), then the exact structure of this hierar- 
chy is irrelevant: the reductions from CLIQUE, as presented in Section [13.2] 
and appearing later in this chapter, already provide practical evidence that 
a problem is unlikely to be FPT. Nevertheless, the reader should be aware 
of the basic terminology of the W-hierarchy, as the parameterized algorithms 
and complexity literature usually assumes its knowledge. 


A Boolean circuit is a directed acyclic graph where the nodes are labeled 
in the following way: 


e every node of indegree 0 is an input node, 
e every node of indegree 1 is a negation node, 
e every node of indegree > 2 is either an and-node or an or-node. 


Additionally, exactly one of the nodes with outdegree 0 is labeled as the 
output node (in addition to being, for example, an and-node). The depth 
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Fig. 13.3: (a) A graph G with five vertices and seven edges. (b) A depth-3, 
weft-1 circuit satisfied by the independent sets of G. Each or-node corresponds 
to an edge of G and has indegree 2. (c) A depth-2, weft-2 circuit satisfied by 
the dominating sets of G. Each or-node corresponds to a vertex of G and its 
inneighbors correspond to the closed neighborhood to the vertex 


of the circuit is the maximum length of a path from an input node to 
the output node. 


Assigning 0-1 values to the input nodes determines the value of every node 
in the obvious way. In particular, if the value of the output gate is 1 in an 
assignment to the input nodes, then we say that the assignment satisfies the 
circuit. It can be easily checked in polynomial time if a given assignment 
satisfies the circuit. 

Deciding if a circuit has a satisfying assignment is clearly an NP-complete 
problem: for example, 3-SAT is its special case. We can define a parame- 
terized version of finding a satisfying assignment in the following way. The 
weight of an assignment is the number of input gates receiving value 1. In 
the WEIGHTED CIRCUIT SATISFIABILITY (WCS) problem, we are given a 
circuit C and an integer k, the task is to decide if C has a satisfying as- 
signment of weight ezactly k. By trying all the O(n") assignments of weight 
exactly k and then checking whether it satisfies C, we can solve the problem 
in polynomial time for every fixed k. However, the problem does not seem 
to be fixed-parameter tractable: it is quite easy to reduce hard parameter- 
ized problems such as CLIQUE or DOMINATING SET to WEIGHTED CIRCUIT 
SATISFIABILITY (see Fig. [13.3). 

The levels of the W-hierarchy are defined by restricting WEIGHTED CIR- 
CUIT SATISFIABILITY to various classes of circuits. Formally, if C is a class of 
circuits, then we define WCS[C] to be the restriction of the problem where 
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the input circuit C belongs to C. It seems to be a natural idea to restrict 
the depth of the circuits in the class C; there is a large body of literature on 
the expressive power of bounded-depth circuits. However, the restriction we 
need here is a bit more subtle than that. First, we distinguish between small 
nodes, which have indegree at most 2, and large nodes, which have indegree 
> 2 (as we shall see later, the choice of the constant 2 is not essential here). 
The weft | of a circuit is the maximum number of large nodes on a path from 
an input node to the output node. We denote by C;,q the class of circuits with 
weft at most t and depth at most d. 


Definition 13.16 (W-hierarchy). For t > 1, a parameterized problem P 
belongs to the class Wft] if there is a parameterized reduction from P to 
WCS[C: a] for some d > 1. 


Exercise [13.6]shows that the constant 2 in the definition of small/large nodes 
is not essential: any other constant > 2 would result in the same definition 
for Wft]. 

As indicated in Fig. it is possible to reduce INDEPENDENT SET to 
WEIGHTED CIRCUIT SATISFIABILITY for circuits of depth 3 and weft 1. By 
Definition [13.16] membership in W[1] follows. 


Proposition 13.17. INDEPENDENT SET is in W[1]. 


One can also show the converse statement: every problem in W[1] can be 
reduced to INDEPENDENT SET, that is, the problem is W[1]-hard. The proof 
of this statement is nontrivial and beyond the scope of this brief introduction. 
Together with Proposition [13.17] it follows that INDEPENDENT SET is W[1]- 
complete. 


Theorem 13.18 ([151]). INDEPENDENT SET is W[1]-complete. 


The circuit in Fig. expressing INDEPENDENT SET has a very specific 
form: it consists of one layer of negation nodes, one layer of small or-nodes 
with indegree 2, and a final layer consisting of a single large and-node. Theo- 
rem [13.18]shows that this form is in some sense canonical for weft-1 circuits: 
the satisfiability problem for (bounded-depth) weft-1 circuits can be reduced 
to weft-1 circuits of this form. 

What would be the corresponding canonical forms for weft-t circuits for 
t > 2? We need some definitions first. We say that a Boolean formula is t- 
normalized if it is of the following form: it is the conjunction of disjunctions 
of conjunctions of disjunctions, ..., alternating for t levels. For example, the 
formula 

(a1 V Z3 V Z5) A (T1 V £2 V T4 V £5) A (x1 V T2) 


is 2-normalized, as it is the conjunction of disjunctions of literals. More for- 
mally, we first define both Ap and Tọ to be the set of formulas consisting of 





3 Weft is a term related to weaving cloth: it is the thread that runs from side to side in 
the fabric. 
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only a single literal. Then for t > 1, we define A; to contain formulas that are 
the disjunction of an arbitrary number of [;_, formulas and I; to contain 
formulas that are the conjunction of an arbitrary number of A;—; formulas. 
With these definitions, I, is exactly the set of tnormalized formulas. We say 
that a ¢-normalized formula is monotone (resp., antimonotone) if every literal 
is positive (resp., negated). 

The WEIGHTED t-NORMALIZED SATISFIABILITY problem is the weighted 
satisfiability problem corresponding to circuits representing t-normalized for- 
mulas; the monotone/antimonotone versions of the problem are defined in 
the obvious way. The following theorem, whose proof again goes beyond the 
scope of this introduction, characterizes the classes Wt] for t > 2 in terms of 
these problems. 


Theorem 13.19 ([149} {150} [151]). 
1. For every even t > 2, the following problems are W[t|-complete: 


e WEIGHTED t-NORMALIZED SATISFIABILITY 
e WEIGHTED MONOTONE t-NORMALIZED SATISFIABILITY 
e WEIGHTED MONOTONE (t + 1)-NORMALIZED SATISFIABILITY 


2. For every odd t > 3, the following problems are W{t|-complete: 


e WEIGHTED t-NORMALIZED SATISFIABILITY 
e WEIGHTED ANTIMONOTONE t-NORMALIZED SATISFIABILITY 
e WEIGHTED ANTIMONOTONE (t + 1)-NORMALIZED SATISFIABILITY 


Theorem has three rather surprising aspects. First, it shows that t- 
normalized formulas are sufficient to express the full power of (bounded- 
depth) weft-t circuits. These formulas have very specific structures: besides 
the negation nodes at the inputs, they consist of t layers of large nodes, al- 
ternating between and-nodes and or-nodes. One can interpret Theorem [13.19] 
as saying that additional layers of small nodes do not increase the expressive 
power, as t-normalized formulas already give rise to W|t|-complete problems. 
This justifies why we defined Wft] based on weft instead of depth. Second, 
Theorem states that already monotone ¢-normalized formulas (for even 
t) and antimonotone t-normalized formulas (for odd t) are sufficient to cap- 
ture the full power of W{¢]. Finally, Theorem also states the surprising 
fact that the power of monotone/antimonotone ¢t-normalized and (t + 1)- 
normalized formulas coincide for certain values of t. 

Let us focus now on W[2], the second level of the hierarchy. Figure [13.3] 
shows that it is possible to reduce DOMINATING SET to WEIGHTED CIRCUIT 
SATISFIABILITY for circuits of depth 2 and weft 2; hence membership in W[2] 
follows. We have seen in Section [13.2]that DOMINATING SET and SET COVER 
(and hence HITTING SET) are equally hard with respect to parameterized 


reductions (Theorem |13.10|and Exercise |13.2). 


Proposition 13.20. DOMINATING SET, SET COVER, and HITTING SET are 
in W([2]. 
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We observe now that these problems are complete for W[2]. All we have 
to note is that HITTING SET is equivalent to WEIGHTED MONOTONE 2- 
NORMALIZED SATISFIABILITY. A 2-monotone normalized formula is of the 
form 

(xı V T3 V ze) A (£2 V x3) A (xı V £5 V ze). 


To satisfy such a formula, we have to set at least one variable x; to 1 in each 
clause. This is exactly the HITTING SET problem, where the universe is the 
set of variables and the sets are the clauses. Thus Theorem [13.19]shows that 
HITTING SET is W[2]-complete. 


Theorem 13.21. DOMINATING SET, SET COVER, and HITTING SET are 
W ([2]-complete. 


The usual expectation is that the W-hierarchy is a proper hierarchy: W[t] 4 
W(t + 1] for every t > 1. In Theorem we have seen a reduction from 
INDEPENDENT SET to DOMINATING SET. In light of Theorem [13.21] we do 
not expect a reduction in the reverse direction: this would imply that IN- 
DEPENDENT SET is both W[1]- and W[2]-complete and hence W[1] = W([2] 
would follow. 


*13.4 Turing machines 


CLIQUE and INDEPENDENT SET are among the most studied optimization 
problems in computer science; hence the fact that no algorithm with run- 
ning time n°“) was found despite massive efforts is good practical evidence 
suggesting that these problems are not fixed-parameter tractable. The reduc- 
tions in Section [13.2] (and those appearing later in Section [13.6) transfer this 
apparent intractability to several other problems. In this section, we give a 
more theoretical supporting evidence for the intractability of these problems. 

Turing machines formalize the intuitive notion of computation in a precise 
mathematical way. Computational problems where the task is to predict what 
a Turing machine will do are typically very hard: as Turing machines can 
express arbitrarily complex computations, it seems that one needs to simulate 
the computation of the Turing machine to answer these questions. In the case 
of the HALTING problem, there is even a formal proof of hardness: a classic 
diagonalization argument shows that there is no algorithm deciding whether 
a Turing machine halts in a finite number of steps. The P 4 NP problem is 
a time-bounded version of this question: one has to decide in deterministic 
polynomial time whether a nondeterministic polynomial-time Turing machine 
has an accepting path. While there is no proof that this is not possible, it 
seems that a nondeterministic Turing machine is such an opaque and generic 
object that it is unlikely that one can avoid simulating a superpolynomial 
number of computation paths. 
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We define a parameterized problem analogously to the acceptance prob- 
lem of nondeterministic polynomial-time Turing machines, and use it to give 
further evidence that CLIQUE is not fixed-parameter tractable. In the SHORT 
TURING MACHINE ACCEPTANCE problem, the input contains the description 
of a nondeterministic Turing machine M, a string x, and an integer k; the 
task is to decide if M with input x has a computation path reaching an ac- 
cepting state in at most k steps. If the reader sees this problem for the first 
time, it might appear that simulating a Turing machine for a small number k 
of steps is an easy task, but let us point out that a nondeterministic Turing 
machine may branch into an unbounded number of states (bounded only by 
the length n of the input), hence naive simulation of k steps would lead to 
an nC) time algorithm. Therefore, the same intuitive argument suggesting 
P Æ NP also supports the conjecture that SHORT TURING MACHINE AC- 
CEPTANCE is not fixed-parameter tractable: it is unlikely that a problem that 
requires understanding a Turing machine with n°) computation paths is 
FPT. 

We present a parameterized reduction from SHORT TURING MACHINE 
ACCEPTANCE to INDEPENDENT SET in this section. This can be considered 
as evidence for the fact that INDEPENDENT SET (as well as all the other 
problems considered in Section [13.2} are not fixed-parameter tractable. The 
reduction is fairly tedious, but the principle is simple. An accepting compu- 
tation path of k steps can be fully described by O(k?) pieces of information: 
a sequence of k states, a sequence of k transitions, the content of the first k 
cells of the tape of a Turing machine in each of the first k steps. We need 
to construct a graph where the choice of vertices for an independent set of 
size O(k?) represents all this information, and then make sure that this infor- 
mation is consistent, that is, correctly describes a sequence of configurations 
forming an accepting computation path. 

We assume that the reader is familiar with the concept of Turing machines 
(see standard textbooks such as for more background). Here we 
review only the basic notation required for the discussion of Turing machines. 
Since we are using Turing machines as evidence for hardness, and not as a 
basis for building complexity theory, we avoid treating them in greater detail. 
We remind the reader that certain details and conventions in the definition 
(such as whether the tape is infinite in both directions, whether the machine 
can leave an accepting state, etc.) usually do not matter for the computational 
power of Turing machines and proofs can be changed accordingly. 


A single-tape nondeterministic Turing machine is described by a tuple 


M = (Q, X, A, so, F), where 


e Q is a finite set of states, 
e X is the alphabet, 
e so € Q is the initial state, 
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e F CQ is the set of accepting states, and 
e ACQx (LU {$,O}) x Q x (LU {$}) x {4, —, 3} is the transition 


relation. 














The meaning of the transition relation is the following: if, for example, 
(q,a,q',b,<-) € A, then this means that whenever the Turing machine is in 
state q and symbol a appears under the head, then one possible legal step 
is to move into state gq’, write b on the tape, and move the head one step to 
the left. The special symbol L represents empty cells of the tape and special 
symbol $ appears at the left end (position 0) of the tape. Initially, the tape 
contains an input word x followed by an infinite sequence of O symbols; the 
head is at position 1. We assume that whenever the Turing machine reads 
the symbol §, it does not change it (that is, writes $ on it) and does not move 
further to the left. 


























Theorem 13.22. There is a parameterized reduction from SHORT TURING 
MACHINE ACCEPTANCE to INDEPENDENT SET. 


Proof. Let (M,x,k) be an instance of SHORT TURING MACHINE ACCEP- 
TANCE. We construct an equivalent instance (G,k’) of INDEPENDENT SET 
in the following way. By minimal modifications of the Turing machine M, 
we may assume that whenever it reaches an accepting state, it stays there. 
Therefore, we may assume that the computation path is infinite and the ques- 
tion is whether M happens to be in an accepting state after step k. Note that 
during the first k steps, the Turing machine can visit only positions 0, 1, ..., 
k + 1 of the tape; hence it is sufficient to “simulate” what happens on these 
positions. The vertices of G are 











© aijo for every OS i<k,0O<j<kt+1,o€ YU {$,0} and 
e bija for every 0<i<k,0<j<k+1, ôE A. 





The intuitive meaning of ai jo being in the independent set is that after 
step i, the symbol at position j of the tape is ø and the head is not on 
this position. The intuitive meaning of b; j being in the independent set is 
that after step i, the head is at position j, and the next transition is 6; in 
particular, if, say, 6 = (q,0,q',0',<-) € A, then this means that after step 
i, the symbol at position j of the tape is o and the internal state is q. Let 
Aij = {aij : oE LYU{$,O}}, let Bij 4 contain every b; j where ô € A 
is a transition such that the first coordinate of 6 is q and the second coordinate 
of 6 is ø. Let Bija, = Usesuts, } Bija,» let Bi j,_,o = Uco Bij.q,o5 and 
let Bij = Useq Usesurs, y Bi jao, Let k! = (k + 1)(k + 2). 

We remove some of the vertices of the graph to enforce certain boundary 
conditions: 









































(i) Let us remove Ap 1U(Bo,1 \ Bo,1,s9,0), where ø is the first symbol under 
the head in the initial position. This ensures that a vertex bo1,5 € 
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Bo,1,80,0 Of the independent set correctly describes the initial state of 
the Turing machine. 

For every 2 < j < k + 1, let us remove (Ao,; \ {@0,3,0}) U Boj, where o 
is the j-th symbol of the tape in the initial configuration. This ensures 
that the initial state of the input tape is correctly represented by the 
independent set. 

For every 0 <i < k, let us remove (Ajo \ {a;,0,$}) U (Bio \ Bio, $)- 
This ensures that the symbol at position 0 of the tape is always $. 
For every 0 < j < k+1, let us remove Bk jq, for every q ¢ F. 
This ensures that the vertex bp, js representing the state of the Turing 
machine after step k represents an accepting state. 


To enforce the interpretation of the vertices described above, we add edges 
in the following way. 


(i) 


(iv) 


For every 0 <i < k, 0 < j < k+1, we make A; U Bij a clique. 
This ensures that the only way to have an independent set of size k’ = 
(k + 1)(k + 2) is to select exactly one vertex from each A; j U B;,;. 

For every 0 < i < k, we make eS B; j a clique. This ensures that, for 
every i, at most one vertex of the independent set can be from some 
B; j; the rest has to be from the A; ,’s, corresponding to the fact that 
the head of the Turing machine is at one position after step i. 

For every 0<i<k,0<j<k+4+1,andoe¢ YU {$,O}, we make aijo 
adjacent to every vertex of (Aisi; \ {4i+41,j,0}) U (Bit; \ Bis1,3,_,0)- 
This ensures that if the head of the Turing machine is not at position 
j after step i, then the same symbol o appears at position j also after 
step i+ 1. That is, the vertex selected from Aj+1,; U Bi+1,; should be 
either aj41,;,0 or a vertex from Bi+1,j, o- 

For every 0<i<k,0<j<k+1, and 6 € A, we add edges to b; j, in 
the following way: 























e if ô = (g,0,q',0',<), then bij is adjacent to every vertex of 
Ajyij—1U (Bit1,j-1 \ Biti,j-1,q',_)U (Aiti,3 \ {@i+1,3,0°}) (note that 
such a left move is not possible for j = 0, since then b; jg € Bij, $ 
and no left move is defined on symbol $. This ensures that if b; ;,5 
is in the independent set, then the independent set represents that 
after step 7+ 1, the head is at position 7 — 1, the state is q’, and the 
symbol at position j is o’. 

e if ô= (q,0,q',0’,—), then b;,;,5 is adjacent to every vertex of Aj+1,;U 
(Bi 41,5 \ Bit1,j,q',0’): 

e ifd =(q,0,¢q',0',—), then b; js is adjacent to every vertex of (Aj+1,;\ 
{aita,j,o/}) U Aisiyj4a U (Bitij41 \ Bitajtie’,_)- 


As every };,;,5 is fully connected to one of Aj41,j;-1, Ai+1,j, and Aiti,j44, 
these edges ensure that if some b; j, is in the independent set, then a 
vertex of Bi+1,j—1 U Bi41,; U Bi+1,j+1 is also in the independent set, 
that is, a position of the head and a state is defined after step i + 1 
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as well. Moreover, this new state and position are the result of a valid 
transition 6 € A. 


We claim that M can be in an accepting state on input x after k steps if and 
only if G has an independent set of size k’. For the first direction, if M has an 
accepting path, then we can select k’ vertices according to the interpretation 
of the vertices ai j, bi j described above. Then it can be verified that this 
set remains independent after adding the edges introduced in (i)-(iv). 

For the other direction, suppose that G has an independent set I of size k’. 
This is only possible if it contains exactly one vertex from A; jU B; j for every 
O0<i<k,0< j< k+1. We claim that, for every 0 < i < k, the set I; of k+2 
vertices selected from A; oU B;,0, ..., Ai,n41U Bi,k+1 describe a configuration 
C; of the Turing machine, and these k + 1 configurations together describe 
an accepting computation path. The way we removed vertices ensures the 
statement for i = 0, and in fact the configuration Co corresponding to Jo is the 
initial state of the Turing machine and the tape. Let us prove the statement 
by induction on i: suppose that the configurations Co, ..., Ci describe the 
first i steps of a computation path. Then the way we have added edges ensure 
that [;,1 describes a configuration Ci+ı that can follow C;. In particular, as 
Bisi U +++ U Biz1,n41 is a clique, at most one vertex of [;;; can describe 
the position of the head. One vertex of I; describes the position of the head 
after step i, that is, I; contains a vertex of B; j for some 0 < j <k+1. Then 
the edges introduced in (iv) ensure that J;41 also contains at least one vertex 
describing the position of the head: every vertex of Bi; is fully connected 
to either Aj+4i,j-1, Ai+1,j, Or Ai+i,j41, which forces the selection of a vertex 
from Bj+41,;-1, Bi+1,;, or Bi+1,j+1, respectively. Therefore, exactly one vertex 
of [;41 describes the position of the head. This means that we can define the 
configuration Ci+ı based on J;,1, and it can be verified (by analysing the 
edges added in (iii)-(iv)) that it is indeed a configuration that can follow 
Ci. 














It is not difficult to give a reduction in the reverse direction: from INDE- 
PENDENT SET to SHORT TURING MACHINE ACCEPTANCE (Exercise [13.8), 
showing the W[1]-completeness of the latter problem. 


Theorem 13.23. SHORT TURING MACHINE ACCEPTANCE is W[1]-complete. 


*13.5 Problems complete for W[1] and W[2] 


In Section we have seen a set of problems that are at least as hard 
as CLIQUE or DOMINATING SET, and therefore W[1]-hard or W[2]-hard, re- 
spectively. Now we complete the picture by showing that those problems are 
actually W[1]-complete or W[2]-complete. 

CLIQUE and INDEPENDENT SET are both W{1]-complete (Theorem|13.18). 
We have reduced INDEPENDENT SET to PARTIAL VERTEX COVER in The- 
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orem showing that PARTIAL VERTEX COVER is W[1]-hard. To show 
membership in W[1], we can reduce PARTIAL VERTEX COVER to SHORT 
TURING MACHINE ACCEPTANCE. 


Theorem 13.24. There is a parameterized reduction from PARTIAL VERTEX 
COVER to SHORT TURING MACHINE ACCEPTANCE. 


Proof (sketch). The idea is the following. We encode the input instance to 
PARTIAL VERTEX COVER using the states and transitions of the output Tur- 
ing machine. Moreover, we take a large alphabet that includes a distinct 
symbol for each vertex of the input graph. In the first k steps of the Turing 
machine, a set S of k vertices is guessed and written on the tape. The re- 
maining steps are deterministic. The Turing machine checks whether the k 
vertices are distinct and counts the total degree of the k vertices. The total 
degree can be more than the number of edges covered by S, as edges with 
both endpoints in S are counted twice. Therefore, the Turing machine counts 
the number of edges induced by S: it reads every pair of vertices from the 
tape, keeps it in the internal state and increases the counter if they are adja- 
cent. In particular, the input graph is encoded by the transition function: for 
every state representing a pair of vertices and the value of the counter, the 
transition function defines a new state where the counter is increased by 1 (if 
the two vertices are adjacent) or has the same value (if the two vertices are 
nonadjacent). Finally, the Turing machine accepts if the total degree minus 
the number of induced edges is at least s. It is not difficult to implement this 
procedure with a Turing machine that takes f(k) steps; the details are left 


to the reader (Exercise |13.10). 


An instance (G, k, (Vi,..., Vk)) of MULTICOLORED CLIQUE can be easily 
reduced to CLIQUE: all we need to do is remove those edges whose both end- 
points are in the same V;. Then any k-clique has exactly one vertex from each 
Vi. Therefore, MULTICOLORED CLIQUE and MULTICOLORED INDEPENDENT 
SET are both W[1]-complete. 














Theorem 13.25. The following parameterized problems are W[1|-complete: 


CLIQUE 

MULTICOLORED CLIQUE 

INDEPENDENT SET 

MULTICOLORED INDEPENDENT SET 
PARTIAL VERTEX COVER 

SHORT TURING MACHINE ACCEPTANCE 


We have seen in Theorem [13.21]that DOMINATING SET, SET Cover, and 
HITTING SET are W[2]-complete. Theorem [13.15] presented a reduction from 
DOMINATING SET to CONNECTED DOMINATING SET, hence CONNECTED 
DOMINATING SET is W[2]-hard. Let us try to show a reduction in the reverse 
direction to prove that CONNECTED DOMINATING SET is W[2]-complete. For 
this purpose, we introduce an auxiliary problem first. The DOMINATING SET 
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WITH PATTERN is a version of DOMINATING SET where we prescribe what 
subgraph the dominating set should induce. Formally, the input is a graph G, 
an integer k, and a graph H on the vertex set [k]. The task is to find a k-tuple 
(v1,...,U%) of distinct vertices such that (1) they form a dominating set and 
(2) vertices v; and vj are adjacent if and only if ij is an edge of H. It is easy to 
observe that DOMINATING SET WITH PATTERN is W[2]-hard: the reduction 
of Theorem [13.15|from DOMINATING SET to CONNECTED DOMINATING SET 
has the property that if the created instance has a connected dominating set of 
size k, then it has a dominating set of size k inducing a clique (Exercise[13.12). 
We can show that DOMINATING SET WITH PATTERN is in W[2] by reducing 
it to DOMINATING SET. 


Theorem 13.26. There is a parameterized reduction from DOMINATING SET 
WITH PATTERN to DOMINATING SET. 


Proof. Let (G,k,H) be an instance of DOMINATING SET WITH PATTERN. 
We construct a graph G’ as follows. 


(i) For every vertex v € V(G), we introduce k vertices v1, ..., Ug. 

(ii) For every 1 < i < k, the set V; = {v; : v € V(G)} forms a clique. 

(iii) For every 1 < i < k, we introduce two vertices x;, Yi, and make them 
adjacent to every vertex of Vj. 

(iv) For every v € V(G), we introduce a vertex v* and make it adjacent to 
every vertex of {uj : u E€ Nafu], 1 < i< k}. 

(v) For every 1 < i < j < k such that ij is an edge of H and for every 
pair (a,b) of nonadjacent vertices in G (including the possibility a = b), 
we introduce a vertex Wi jap and make it adjacent to every vertex of 
(V; U Vj) \ {as b;}. 

(vi) For every 1 < i < j < k such that ij is not an edge of H and for 
every pair (a,b) such that either a and b are adjacent in G or a = b, 
we introduce a vertex Wi jap and make it adjacent to every vertex of 


(Vi U Vj) \ {ai, 6; }- 


Let (v',...,v") be a k-tuple of distinct vertices of G forming a dominating 
set such that v; and v; are adjacent if and only if ij is an edge of H. We 
claim that S’ = {v},...,vf£} is a dominating set of G’. Vertex v? dominates 
every vertex in V; U {x;, yi}. Consider a vertex v* introduced in (iv). There 
is a vertex v’ dominating v in G, hence vf dominates v* in G”. Consider now 
a vertex Wi j,a,b introduced in (v). As ij is an edge of H, vertices vê and v’ 
are adjacent. As either a = b or a and b are not adjacent, assertions vu =a 
and vf = b cannot be both true, and hence assertions vj = a; and v} = bj 
cannot be both true. This implies that (V; U V;) \ {a;, bj} contains at least 
one of vf and v, and we can conclude that wi ja, is dominated by S’. We 
argue similarly for a vertex w;,j,2,) introduced in (vi): now, as ij is not an 
edge of H, vertices vê and vÍ are not adjacent. As a and b are adjacent or 
a = b, we have again that (V; U V;) \ {a;, bj} contains at least one of v? and 
v, dominating Wi,j,a,b- 
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For the reverse direction, let S’ be a dominating set of G’. It has to contain 
at least one vertex from the set V; U {x;,y;} to dominate the vertices x; and 
yi. As these sets are disjoint for distinct values of i, we can conclude that. S’ 
contains exactly one vertex from V;U {a;, yi}; in fact, exactly one vertex from 
V;, as x; and y; do not dominate each other. Let 9’ = {vt,...,v%}, where 
vt € Vi. We claim first that S = {v!,...,v*} is a dominating set of G: if a 
vertex v € V(G) is not dominated by S, then vertex v* is not dominated by 
S’ in G’. Next we claim that if ij is an edge of H, then vê and wv’ are different 
and adjacent in G. Suppose that ij is an edge of H for some 1 < i < j < k. 
If vê and vf are equal or not adjacent, then we have introduced a vertex 
Wi,j,vtvi that is adjacent only to (V; U V;)\ {v}, uit, hence it is not dominated 
by S’ in G”, a contradiction. Similarly, we claim that if ij is not an edge of H, 
then vt and vÍ are two distinct nonadjacent vertices. Indeed, if vê and vÎ are 
adjacent or equal, then we have introduced a vertex W; j,i); that is adjacent 


only to (V;UV;) \ {vu}, vi}, hence not dominated by S”. In particular, the two 
claims imply that (v',...,v*) are distinct vertices, as any two of them are 
either connected by an edge or nonadjacent distinct vertices. Therefore, we 
get that (v',...,v*) is a k-tuple of distinct vertices forming a dominating set 


in G and inducing the required pattern. 














In the light of Theorem [13.26] we can show that CONNECTED DOMINAT- 
ING SET is in W[2] by reducing it to DOMINATING SET WITH PATTERN. A 
solution of CONNECTED DOMINATING SET induces a connected graph on k 
vertices and there are f(k) = 20°) connected graphs on k labelled vertices. 
Therefore, it is tempting to say that CONNECTED DOMINATING SET can be 
reduced to DOMINATING SET WITH PATTERN by trying every possible such 
pattern and applying an algorithm for DOMINATING SET WITH PATTERN. 
However, this is not a many-one parameterized reduction, as defined in Defi- 
nition [13.1] given an instance J of CONNECTED DOMINATING SET, instead of 
creating a single equivalent instance I’ of DOMINATING SET WITH PATTERN, 
we create a set Ij, ..., I; of instances of DOMINATING SET WITH PATTERN 
such that I is a yes-instance if and only if at least one I; is a yes-instance. 
This is an example of a so-called parameterized Turing reduction. Without 
formalizing the details, a parameterized Turing reduction from A to B is an 
algorithm that solves an instance of A by having access to an “oracle” that 
can solve instances of B in constant time. The reduction has the property 
that, given an instance (x, k) of A, the running time is f(k)|a|O“ and the 
parameter of every created instance of B can be bounded by f(k). 

Turing reductions transfer fixed-parameter tractability the same way as 
many-one reductions: one could state an analogue of Theorem saying 
that parameterized Turing reduction from A to B implies that if B is FPT, 
then A is FPT as well. Therefore, from the point of view of giving negative ev- 
idence for fixed-parameter tractability, Turing reductions are just as good as 
many-one reductions. However, we defined the W-hierarchy by closure under 
many-one reductions, thus to complete the picture, we need a many-one re- 
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duction to prove the membership of CONNECTED DOMINATING SET in W[2]. 
Tt is possible to modify the reduction of Theorem [{3.26]|to output a single 
instance of DOMINATING SET for a collection of patterns (Exercise (13.13). 
Alternatively, we can use the definition of W[2] directly and reduce Con- 
NECTED DOMINATING SET to the circuit satisfiability problem. 


Theorem 13.27. CONNECTED DOMINATING SET is in W[2]. 


Proof. Given an instance (G, k) of CONNECTED DOMINATING SET, we con- 
struct an equivalent instance (C, k) of WEIGHTED CIRCUIT SATISFIABILITY. 
Let v1, ..., Un be the vertices of G. The input nodes of the constructed circuit 
are zij for 1 <i < k, 1 <j <n. The interpretation of xij = 1 is that the 
i-th vertex of the solution is vj. 





(i) For 1 <i < k, node z; is the disjunction of {x;j : 1 < j < n} and 
O, is the conjunction of all the z; nodes. Then O = 1 expresses the 
requirement that there are integers s1, ..., Sk such that £1 s1, ---, Tk,s, 
are exactly the input nodes with value 1, describing a tuple (Vs, ..., Usp) 
of k vertices. 

(ii) For 1 < j < i, node wj expresses that vertex v; is dominated in the 
solution: it is the disjunction of the input nodes {x;y : 1< i< k,vj € 
N{vy]}. 

(iii) Node Og expresses that the input represents a dominating set: it is the 
conjunction of wj for every 1 < j < n. 

(iv) For every P such that Ø Ç P Ç [k], node cp expresses that there is an 
edge between {vs, : i € P} and {v,, : i ¢ P}. For every i; € P, 
ig € [k] \ P, and pair (vj , vja) of adjacent or equal vertices in G, we 
introduce a node ePi ,i2,j1,j2 that is the conjunction of xi, jı and Tis 
the node cp is the disjunction of all these nodes. 

(v) Node O; expresses that (vs,,...,Us,) induces a connected graph: it is 
the conjunction of all the nodes cp. 

(vi) The output node is the conjunction of O1, O2, and O3 (more precisely, 
there is a small node O’ that is the conjunction of O and Og, and the 
output node is a small node that is the conjunction of O’ and O3). 


„J29 


Observe that the constructed circuit C has constant depth (independent of 
k) and weft 2: a path from an input to the output contains either 


e a large node z; and the large node O1, 
e ora large node w; and the large node Oz, 
e ora large node cp and the large node O3. 


It is easy to see that if (vs,,...,Us,) is a connected dominating set, then 
setting %1,5,,---, Uk,s, to 1 satisfies the circuit. In particular, as (vs,,..., Usp) 
induces a connected graph, for every Ø € P Ç [k] there has to be an i; € P 
and ig € [k] \ P such that vs, and vs, are adjacent. This means that the 


node Citi Siis has value 1, implying that cp is 1 as well. 
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The reverse direction is also easy to show: it is clear that any satisfying as- 
signment describes a k-tuple (v;,,...,Us,) of vertices (possibly with repeated 
vertices) that forms a dominating set of the graph. If these vertices did not 
induce a connected graph, then they can be partitioned into two disconnected 
parts, that is, there is a Ø Ç P Ç [k] such that there is no edge between vj, 
and v;, for any i € P and i2 € [k] \ P. Then cp has value 0, implying that 
O3 has value 0, and hence the output has value 0 as well. 














We have shown in Section [3.2] that DOMINATING SET on undirected 
graphs, general directed graphs, tournaments, and SET COVER (or HITTING 
SET) are equally hard (see also Exercises (13.2). We summarize these results 
in the following theorem. 


Theorem 13.28. The following problems are W[2|-complete: 


DOMINATING SET 

DOMINATING SET for directed graphs 

DOMINATING SET ON TOURNAMENTS 

CONNECTED DOMINATING SET 

SET COVER 

HITTING SET 

WEIGHTED 2-NORMALIZED SATISFIABILITY 

WEIGHTED MONOTONE 2-NORMALIZED SATISFIABILITY 
WEIGHTED MONOTONE 3-NORMALIZED SATISFIABILITY 


13.6 Parameterized reductions: further examples 


In Section [13.2| we have presented reductions showing that basic problems 
such as DOMINATING SET are unlikely to be fixed-parameter tractable. The 
goal of this section is to explain, via further examples, the different types of 
parameterized reductions one can encounter in the parameterized complexity 
literature. 


13.6.1 Reductions keeping the structure of the graph 


The simplest type of reduction is when there is relatively simple correspon- 
dence between the input graph and the output graph. This is the case, for 
example, in the trivial reduction from CLIQUE to INDEPENDENT SET, or 
when reducing DOMINATING SET to CONNECTED DOMINATING SET (Theo- 
rem [13.15}. In other examples, the reduction may involve more complicated 
operations than just taking the complement of the graph (e.g., subdividing 
edges, replacing vertices/edges with certain fixed subgraphs, etc.), but the 
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Fig. 13.4: W{1]-complete and W/[2]-complete problems in Sections 
and the reductions between them. WCS[Mon 2-NorMm] is short for 
WEIGHTED MONOTONE 2-NORMALIZED SATISFIABILITY 


correspondence between the vertices /edges of the two graphs is fairly straight- 
forward. The following reduction is an example where the correspondence is 
still relatively clear, although not completely trivial. In the BALANCED VER- 
TEX SEPARATOR problem, we are given a graph G and integer k; the task is 
to find a set S of at most k vertices such that every component of G—S has 
at most |V(G)|/2 vertices. 


Theorem 13.29. BALANCED VERTEX SEPARATOR is W[1]-hard. 


Proof. We give a parameterized reduction from CLIQUE. Let (G,k) be an 
instance of CLIQUE. Let n = |V(G)| and m = |E(G)|. We may assume 
that n is sufficiently large compared to k, for example, n > 4k?: adding 4k? 
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isolated vertices to the graph G does not change the problem. Let us set 
L=n+m-—2(k+ E) > 0. We construct a graph G” as follows. 


(i) The vertex set of G’ contains V (G), which forms a clique in G’. 
(ii) For every edge e = uv of G, we introduce a vertex we in G” and make 
it adjacent to u and v. 
(iii) We introduce a clique K on £ new vertices as another component of G”. 


The output instance is (G’, k). 


The intuition behind the construction is that if we remove a set S of 
vertices from the clique formed by V(G) in G’, then it creates some 
number of components of size 1, corresponding to the edges induced 
by S in G. Therefore, if we want to maximize the number of these 
components appearing (to reduce the size of the other components), 
then we need a set S that maximizes the number of edges induced in 
G, that is, forms a k-clique. 





Note that G” has n+m+é = 2(n4+m—k (*)) > 2l vertices. Suppose that S 
is a k-clique of G. Then G’ \ S has the following components: (5) components 
of size 1, the clique K of size £ < |V(G’)|/2, and one component containing 
the remaining (n +m + £) — |S| (5) l=n+m—-k (4) = |V(G")|/2 
vertices, that is, every component of G’ — S has size at most |V(G")|/2. 

For the reverse direction, suppose that S is a set of at most k vertices 
in G” such that every component of G’ — S has size at most |V(G")|/2 = 
nt+tm—k— ($): First we show that it can be assumed that S C V(G). 
Clearly, it makes no sense to remove any vertex of the clique K of size @, 
as the size of this component is already at most |V(G’)|/2. Suppose that 
We E S for some edge e = uv. If u,v € S, then we many omit we from S: 
now We becomes an isolated vertex in G — (S \ {we}). If, say, u Z S, then 
(S \ {we}) U {u} is also a solution: omitting We from S increases the size of 
the component of u by 1 (note that u and v are adjacent, hence they cannot 
be in two different components of G’ — S), and introducing u into S decreases 
the size of this component by 1. Therefore, we may assume S C V(G). 

Consider the component C of G’ — S that contains the clique V(G) \ S (as 
we have assumed that n = |V(G)| > k, the component C is nonempty). If an 
edge e € E(G) has an endpoint not in S, then vertex we ¢ S of G” is also in 
the component C. Thus if we denote by p the number of edges induced by S 
in G, then component C has size (n — |S|) +m — p. Then the bound on the 
size of C implies (n — |S|) +m — p < |V(G)|/2=n+m—-—k—- Cs that is, 
we get |S] +p > k+ (E): As |S| < k, this is only possible if |S| = k and S 
induces a clique in G. 
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13.6.2 Reductions with vertex representation 


The most common technique for NP-hardness proofs is to represent parts of 
the original instance by gadgets. When reducing a graph problem to another 
graph problem, this is done by replacing each vertex and each edge of the 
original graph by a “gadget”: a small graph satisfying certain properties. Then 
it is argued that these gadgets have properties ensuring that the solutions of 
the constructed instance correspond to the solutions of the original instance. 
A typical example of a reduction of this type is reducing VERTEX COVER 
to HAMILTONIAN CYCLE. Similarly, when reducing from 3-SAT, one may 
use gadgets representing the variables and clauses of the original formula. A 
typical example is reducing 3-SAT to CLIQUE. 

Representing vertices and edges of the original graph (or variables and 
clauses of the original formula) by gadgets is usually not a suitable tech- 
nique for parameterized reductions. The problem is that, in most cases, by 
introducing a new gadget, one has to increase the parameter of the con- 
structed instance. Therefore, the parameter of the new instance would be 
proportional to the number of vertices /edges/variables/clauses of the origi- 
nal instance, which can be much larger than the parameter of the original 
instance, violating the second requirement of Definition [13.1] 


A typical parameterized reduction from CLIQUE uses gadgets in a very 
different way than NP-hardness proofs. Instead of creating a gadget 
for each vertex/edge of the input graph, we create k gadgets, one for 
each vertex of the solution. If we can ensure that the new parameter 
is proportional to the number of gadgets, then we satisfy the require- 
ment that the new parameter is bounded by a function of the original 
parameter. 


A consequence of using only k gadgets is that we need very different, much 
more powerful gadgets than in typical NP-hardness proofs. For example, in 
a polynomial-time reduction from, say, VERTEX COVER, the gadget needs to 
be able to represent only a constant number of different states: each vertex 
is either selected or not, and each edge can be covered in three possible ways 
(first endpoint, second endpoint, both endpoints). On the other hand, if we 
want a gadget to represent one of the vertices of the solution, then we need 
a gadget that is able to represent n different states, where n is the number 
of vertices of the original graph. 

Theorem the reduction from MULTICOLORED INDEPENDENT SET 
to DOMINATING SET, can be considered as an example of representing the 
vertices of the solution by gadgets. Selecting one vertex of the clique V; of 
G” corresponds to selecting a vertex of V; into the independent set. Thus 
we may view the clique V; as a gadget with |V;| possible states. Then the 
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vertices we “verify” that the states of the gadgets Vi, ..., Vz correspond to 
an independent set of G. 

The following reduction demonstrates the same principle: we want to rep- 
resent the k vertices of the solution by k gadgets and, for any two of these 
gadgets, we want to ensure that they represent nonadjacent vertices. This 
reduction also serves as an example for a reduction to a problem where the 
parameter is not related to the solution size, but it is a structural parame- 
ter measuring some property of the instance (in our case, the treewidth). In 
such a reduction, what we need to ensure is that the structural parameter is 
proportional to the number of gadgets. 

LIsT COLORING is a generalization of VERTEX COLORING: given a graph 
G, a set of colors C, and a list function L : V(G) — 2° (that is, a subset of 
colors L(v) for each vertex v), the task is to assign a color c(v) € L(v) to each 
vertex v € V(G) such that adjacent vertices receive different colors. VERTEX 
COLORING on a graph with treewidth w can be solved in time 20(¥ los w) .ņO (1) 
by standard dynamic-programming techniques (Theorem [7.10}. However, the 
straightforward application of dynamic programming yields only a nO()- 
time algorithm for the more general LIST COLORING problem (Exercise[7.20}. 
The following theorem shows that there is a reason for that: the problem is 
W[1]-hard parameterized by treewidth. 


Theorem 13.30. LIST COLORING is W[1]-hard when parameterized by the 
treewidth of the graph. 


Proof. We present a parameterized reduction from MULTICOLORED INDE- 
PENDENT SET. Let (G,k,(Vi,...,Ve)) be an instance of MULTICOLORED 
INDEPENDENT SET. The set C of colors corresponds to V(G). We construct 


a graph G’ in the following way (see Fig. |13.5). 


(i) For every 1 < i < k, we introduce a vertex u; with L(u;) = V;. 

(ii) For every 1 < i < j < k and for every edge ab with a € Vi, b € Vj, we 
introduce a vertex Wi jap with L(Wi jap) = {a,b} and make it adjacent 
to u; and uj. 


Observe that {u,,..., uz} is a vertex cover of G’, hence G” has treewidth at 
most k. Therefore, the reduction satisfies the requirement that the parameter 
of the constructed instance is bounded by a function of the parameter of the 
original instance. Suppose that G has an independent set of size k, consisting 
of vertices vı € Vi, ..., Uk © Vk. We can construct the following list coloring 
of G". First, we set the color of u; to v; € Vi. Then for every vertex Wi jab it 
is true that either u; has a color different from a, or uj has a color different 
from b (as a and b are adjacent, while v; and vj are not). Therefore, one of 
the two colors appearing in the list of Wi jab is not used on its neighbors; 
hence we can extend the coloring to Wi j a,b- 

For the reverse direction, suppose that c: V(G’) + V(G) is a list coloring 
of G’. We claim that c(u1) € L(u1) = Vi, ..., c(up) E€ L(uk) = Vp is an 
independent set in G. For a contradiction, suppose that a = c(u;) and b = 
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U2 











Fig. 13.5: The graph G” of the List COLORING instance constructed in the 
reduction of Theorem |13.30|for k = 5 


c(uj) are adjacent for some 1 <i < j < k. Then the vertex Wi ja p exists in 
G with list {a,b}. However, this vertex is adjacent to both u; and uj, hence 
C(Wi,j,a,b) can be neither a nor b, a contradiction. 














13.6.3 Reductions with vertex and edge representation 


In reductions from CLIQUE/INDEPENDENT SET based on vertex representa- 
tion, we need to enforce that gadgets represent adjacent /nonadjacent vertices. 
In Theorem [13.9] we enforced this condition by introducing the vertices we. 
Introducing these vertices had no effect on the parameter, as these vertices 
were supposed to be dominated by the k vertices selected from Vi, ..., Vp- 
Similarly, in Theorem we introduced the vertices w;,j,a,p to enforce 
that the coloring on w1, ..., wx represents an independent set. The intro- 
duction of these vertices did not increase the treewidth of the constructed 
instance. However, there are reductions where it seems difficult to enforce 
the adjacency /nonadjacency condition on two gadgets. We present two ex- 
amples of such problems and show how to prove W[1]-hardness in this case 
by representing both the edges and the vertices of the clique by gadgets. 


Example 1: ODD SET and variants 


In the ODD SET problem, where given a set system F over a universe U and 
an integer k, the task is to find a subset S C U of size at most k such that the 
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intersection of S with every set in F has odd size. A natural idea for reducing 
an instance (G,k,(Vi,...,V,)) of MULTICOLORED INDEPENDENT SET to an 
instance of ODD SET would be to let U be V(G), keep the same parameter 
k, and introduce the sets Vi,...,V,. The only way a set S of size at most 
k can have odd intersection with these k disjoint sets is to have exactly one 
vertex from each of Vi, ..., Vk. Now we would like to introduce sets into F 
to enforce that the vertex of the solution in V; is not adjacent to the vertex 
of the solution in V;. If a € V; and b € Vj are adjacent, then introducing the 
set {a,b} into F certainly prevents the possibility that both a and b is in a 
solution S. However, it also prevents the possibility that neither of a and 6 is 
in S, which may very well be the case for a solution. The reader may try other 
simple tricks, but it seems that the parity requirement is not strong enough 
to represent the complicated condition that two vertices are not adjacent. 

We get around this difficulty by a reduction from MULTICOLORED CLIQUE 
where we represent both the vertices and the edges of the clique, that is, we 
have k vertex gadgets and ($) edge gadgets. Then what we need to ensure is 
that if an edge gadget represents the edge uv, then the corresponding vertex 
gadgets represent u and v, respectively. As we shall see, such conditions are 
much easier to express and enforce. 


Theorem 13.31. ODD SET is W[1]-hard. 


Proof. We present a parameterized reduction from MULTICOLORED CLIQUE. 
Let (G, k, (V1, ..-, Vg)) be an instance of MULTICOLORED CLIQUE. Let E; j 
be the set of edges between V; and Vj, and let Fi j, be the subset of E; j 
consisting of the edges incident to v. We construct an instance (F, U, k’) of 
ODD SET as follows. 


(i) The universe is U := Uy Vi U Usejc jen Bij. 
(ii) Let k’ := k + ($). 
(iii) For every 1 < i < k, let F contain V;, and for every 1 < i < j < k, let 
F contain E; j- 
(iv) For every 1 < i < j < k, let F contain FE; y := Eijo U (Vi \ {v}) for 
every v € V; and Ef. := Ei jw U (V; \ {w}) for every w € Vj. 


Jw 


The intuitive idea of the reduction is the following. The solution needs 
to select one vertex from each of Vi, ..., Vk (describing k vertices of 
G) and one vertex from each F; j (describing E) edges of G). Then the 
sets introduced in (iv) ensure that the vertices selected from V; and V; 
are the endpoints of the edge represented by the vertex selected from 
E; j. More precisely, the set Ej j „ ensures that exactly one of V; \ {v} 
and Ei j contains a vertex of the solution, that is, v € V; is in the 
solution if and only if the solution contains a vertex of Ei jw- 
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Formally, let vı € Vi, ..., Uk E Ve be a k-clique in G and let e; j be the 
edge between v; and vj. We claim that the set S := UE {UU <iejcn {eis} 
has odd intersection with every set in F. This is clearly true for the sets V; 
and E; j. Consider now a set E; jẹ introduced in (iv). Suppose that v € V; 
(the argument is similar for the case v € V;). If the endpoint of e;,; in V; is 
v, then v = v; and e; j E€ Ei jv hold and hence SN E; jv = {e; j}, which has 
odd size. If the endpoint of e; ; in V; is not v, then v Æ v; and e; j ¢ Fi jw, 
hence SN E; ju = {vi}, which is again odd. 

For the reverse direction, let S C U be a set of size at most k’. As the 
k'=k+ C) sets introduced in (iii) are disjoint, each of them contains exactly 
one vertex of S. Let SOV; = {v;i} and SNE; j = {e:,;}. We claim that vı € Vi, 
..., Uk E Vp is a k-clique in G and e; j is the edge between v; and vj. Suppose 
that v; is not an endpoint of e;,;. Then v; ¢ Ej jw; and eij Z Eiju, hold 
and hence SN E; jv, = 0, a contradiction. The argument is similar if v; is 
not an endpoint of e; j. Therefore, e;,; is the edge between v; and vj; hence 


{v1,...,v%} indeed induces a clique in G. 














In general, the requirement that a € V; and b € V; are adjacent can be an 
arbitrarily complicated binary relation: the bipartite graph between V; and V; 
can be arbitrarily complicated. The key idea in the proof of Theorem [13.31]is 
that the requirement that v € V; is an endpoint of an edge e € E; j is a much 
simpler binary relation: it is a projection, that is, for every e € Fi j, there 
is only a single v € V; that satisfies the requirement. The general principle 
that we can learn from this example is that, after setting up the gadgets 
representing the k vertices and the (5) edges of the clique, all we need to 
ensure is that each state of an edge gadget forces a particular state for the 
the two vertex gadgets representing the endpoints of the edge. That is, we do 
not need to implement arbitrarily complicated relations between the gadgets, 
only a very specific simple relation requiring that a state of an edge gadget 
always implies a particular state on a vertex gadget. 

The proof of Theorem [13.31] allows us to prove W[1]-hardness for some 
variants of ODD SET. First, EXACT ODD SET is the variant that asks for a 
solution of size exactly k. Observe that, given a yes-instance of MULTICOL- 
ORED CLIQUE, the reduction of Theorem [13.31|produces an instance of ODD 
SET that has a solution of size exactly k’. Therefore, this is also a correct 
parameterized reduction from MULTICOLORED CLIQUE to EXACT ODD SET, 
proving the W{1]-hardness of the latter problem. 

One can define the EVEN SET and EXACT EVEN SET problems analo- 
gously: now each set has to contain an even number of vertices of the solu- 
tion. Note, however, that the empty set is always a correct solution for EVEN 
SET with this definition. Therefore, in the EVEN SET problem, we require 
the solution to be a nonempty set of at most k elements such that every set 
in the instance contains an even number of elements of the solution. It is not 
difficult to reduce EXACT Opp SET to EXACT EVEN SET (Exercise [13.19), 
showing that EXACT EVEN SET is also W[1]-hard. On the other hand, the 
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fixed-parameterized tractability of EVEN SET is a notorious open question. 
It is equivalent to other very interesting and deep questions: it can be also 
formulated as finding the minimum distance of a binary linear code or the 
minimum length cycle in a binary matroid. Therefore, settling this question 
may require a deeper combinatorial understanding and could be more chal- 
lenging than the simple reduction of Theorem 

UNIQUE HITTING SET is the variant of ODD SET where we require that 
each set contains exactly one element of the solution; EXACT UNIQUE HIT- 
TING SET is defined analogously. Looking at the reduction in Theorem [13.31] 
we can observe that if the MULTICOLORED CLIQUE instance is a yes-instance, 
then the constructed instance of ODD SET has the property that every set 
contains in fact only a single element of the solution. Therefore, the proof of 
Theorem [13.31] actually provides a reduction from MULTICOLORED CLIQUE 
to UNIQUE HITTING SET or EXACT UNIQUE HITTING SET. 


Theorem 13.32. The following problems are W(1]-hard: 


ODD SET 

EXACT ODD SET 

EXACT EVEN SET 

UNIQUE HITTING SET 
EXACT UNIQUE HITTING SET 


Interestingly, one can prove that UNIQUE HITTING SET and EXACT UNIQUE 
HITTING SET are actually W[1]-complete by a reduction to SHORT TURING 
MACHINE ACCEPTANCE (Exercise[13.20}, but this is open for the other three 
problems. 

It is natural to define analogues of ODD SET (and its variants) on graphs 
with the closed neighborhoods of the vertices playing the role of the set 
system, the same way as DOMINATING SET can be considered as a graph- 
theoretic analogue of HITTING SET. For example, the graph version of ODD 
SET asks for a set of at most k vertices such that N[v] contains an odd number 
of vertices of the solution for every vertex v. The graph version of EXACT 
UNIQUE HITTING SET is also known under the name PERFECT CODE. If the 
vertex set of a graph G is the set of all potential codewords and two vertices 
are adjacent if the corresponding codewords are “close,” then PERFECT CODE 
asks for a code containing exactly k words such that every word is close to 
exactly one word in the code. One can show that the graph versions of all five 


problems considered in Theorem |13.32|are also W[1]-hard (Exercise|13.21). 


Example 2: Steiner problems in directed graphs 


Given an undirected graph G, a set K C V(G) of terminals, and an integer 2, 
the STEINER TREE problem asks for a tree H with at most £ edges connecting 
all the terminals. In Sections and HOS ig gave algorithms for STEINER 
TREE running in time 3/k!lnO@ and 25n) respectively. One can define 
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similar problems on directed graphs. Given a set of terminals K C V(G) and 
a root r € V(G), DIRECTED STEINER TREE asks for a directed tree rooted 
at r such that every terminal in K is reachable from r on the tree. This 
problem generalizes STEINER TREE to directed graphs in the sense that we 
are looking for a subgraph that provides reachability from one distinguished 
vertex to every other terminal. See Exercise|6.7|for a 3/Kln°_-time algorithm 
for DIRECTED STEINER TREE. Another, equally natural, generalization is 
the STRONGLY CONNECTED STEINER SUBGRAPH problem, where the input 
is a directed graph G, a set K C V(G) of terminals, and an integer 4; the 
task is to find a strongly connected subgraph of G with at most £ vertices 
that contains every vertex of K. This problem generalizes STEINER TREE 
in the sense that we have to provide a path from any terminal to every 
other terminal. Note that one can define STRONGLY CONNECTED STEINER 
SUBGRAPH by requiring either at most / edges or at most £ vertices in the 
solution. The two versions are different, but here we discuss only the vertex 
version for simplicity. The parameter we consider is the number £ of vertices 
in the solution. Unlike DIRECTED STEINER TREE, this problem is W[1]-hard 
and the proof is another example of the edge representation strategy. 


Theorem 13.33. STRONGLY CONNECTED STEINER SUBGRAPH is W[1]- 
hard. 


Proof. We present a parameterized reduction from MULTICOLORED CLIQUE. 
Let (G, k, (V1, ..-, Vg)) be an instance of MULTICOLORED CLIQUE. Let E; j 
be the set of edges between V; and Vj. We construct an instance (G’, K, £) of 
STRONGLY CONNECTED STEINER SUBGRAPH as follows (see Fig. [13.6). 


(i) G’ contains all the vertices of G, a vertex x, vertices y;,; (1 <i < j < k), 
and a vertex we for every e € E(G). We define Ej}; = {we : e € Fiz} 
for 1<i<j<k. 

(ii) Let K := {z} U {yiz : 1<i< j< k} and let £ := k+1+2(8) = 
|K| +k + ($). 

(iii) For every 1 <i < k and v € Vj, we introduce the edges (x, v) and (v, x). 
(iv) For every 1 <i < j < k and e € F; j, we introduce the edges (y;,;, we) 
and (We, Yi,j)- 

(v) For every 1 < i < j < k and e € E;,,;, if a € V; and b € V; are the 
endpoints of e, then we introduce the edges (a, we) and (we, b). 


The intuitive idea of the reduction is the following. The solution can 
afford to select only one vertex from each of Vi, ..., Vk (“vertex gad- 
gets” describing k vertices of G) and one vertex from each E; ; (“edge 
gadgets” describing (o) edges of G). Each vertex we in E£;,; has only 
one inneighbor and one outneighbor different from y; j; thus if we is 
part of the solution, then those two vertices have to be selected as well. 
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Fig. 13.6: An overview of the reduction in the proof of Theorem The 
circled vertices are the terminals and the red edges show a possible solution. 
For clarity, we show only those edges of the graph between the V;’s and E; j 
that are in the solution 


Therefore, the state of each edge gadget forces a state on two vertex 
gadgets, implying that the k-vertex gadgets describe a k-clique in G. 


Formally, let vı € Vi, ..., Uk € Ve be a k-clique in G and let e;,; be the edge 
between v; and vj. We claim that the set S:= KU {v; : 1<i<k}U{eij : 
1 <i < j < k} induces a strongly connected subgraph G’[S]; note that 
|S| = £. For every 1 < i < j < k, the directed closed walk xujwe, „Yi jWe; VjE 
uses only vertices of S. Moreover, these (5) directed closed walks cover every 
vertex of S, hence G’[S] is strongly connected. 

For the reverse direction, let S be a set of size at most £ such that K C 
S C V(G') and G’[S] is strongly connected. Clearly, such a set has to include 
at least one outneighbor of each vertex in K, which means that S contains 
at least one vertex of E; ; for every 1 < i < j < k. All the inneighbors of 
E; j U {yi j} are in V;, while all the outneighbors are in V;, hence each of V; 
and V; has to contain at least one vertex of the solution. Taking into account 
the quota of = |K|+k+ es) on the size of the solution, this is only possible 
if S contains exactly one vertex v; € V; for every 1 < i < k and exactly one 
vertex We, , € E; j for every 1 < i < j < k. We claim that vı € Vi, ..., 
Uk E Vp is a k-clique in G and e; ; is the edge between v; and vj. Suppose 
that v; is not an endpoint of e; j. Then v; is not an inneighbor of we, ,. The 
only other inneighbor of we, , is yi,j and the only inneighbor of y;,; in S is 
We, ; itself. Therefore, the strongly connected component of G'[S] containing 
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We; ; consists of only yj; and we, ,, a contradiction. The argument. is similar 


if v; is not an endpoint of e; j. Therefore, e; j is the edge between v; and vj, 
hence {v1,..., Uk} indeed induces a clique in G. 














Note that the reduction in Theorem[13.33]creates a directed graph containing 
bidirected edges, that is, there are pairs of vertices that are connected by 
directed edges in both directions. Exercise [13.26] asks for a reduction that 
does not create bidirected edges. 


Exercises 


13.1 (@). Is there a parameterized reduction from VERTEX COVER to INDEPENDENT SET? 
13.2 (Z). Give a parameterized reduction from SET Cover to DOMINATING SET. 


13.3 (Z). Inthe MULTICOLORED BICLIQUE problem the input consists of a bipartite graph 
G with bipartition classes A, B, an integer k, a partition of A into k sets A1, Ao,..., Ak, 
and a partition of B into k sets Bi, Bo,..., Br; the question is whether there exists a 
subgraph of G isomorphic to the biclique K;,,,, with one vertex in each of the sets A; and 
Bi. Prove that MULTICOLORED BIcLIQque is W[1]-hard. 


13.4. Prove Theorem [13.12 


13.5. In this exercise we will work out an alternative, purely combinatorial construction 
of k-paradoxical tournaments of size 2P°ly(*), 


e Find a 2-paradoxical tournament T* on seven vertices. 

Assume we are given some tournament T. Construct a tournament T’ as follows. The 
vertices of T” are triples of vertices of T, i.e., V(T’) = V(T) x V(T) x V(T). Let us 
consider a pair of triples u1 = (a1, b1,c1) and u2 = (a2, b2,c2), where a1 Æ a2, bı Æ b2, 
and cı Æ c2. Consider now pairs {a1, a2}, {b1, b2}, {c1,c2}, and count for how many 
of them the edge in T was directed from the vertex with subscript 1 to the vertex of 
subscript 2 (e.g., from a to a2). If for at least two pairs this was the case, we put 
(ui,u2) E€ E(T’), and otherwise we put (u2,u1) € E(T'). For all the pairs of triples 
where at least one of the coordinates is the same, we put the edge between the triples 
arbitrarily. Prove that if T was k-paradoxical, then T” is | 4 |-paradoxical. 

e Define the sequence To, Ti, T2,... as follows: To = T*, and for m > 1 the tournament 
Tm is constructed from Tm —1 using the construction from the previous point. Prove 
that |V(Tm)| = 73” and that Tm is g(m)-paradoxical for a function g(m) € 2((3/2)™). 

e Using the previous point, provide an algorithm that, given an integer k, constructs a 
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k-paradoxical tournament of size 20(* ) < 20k") The construction should 
work in time polynomial with respect to the size of the constructed tournament. 


13.6. Let Cs 4,q be the class of circuits having depth at most d where every path from an 
input node to the output node contains at most t nodes with indegree larger than s. (Thus 
the class C+ q defined in Section [13.3]is C2,t,4-) Show that, for every t,d > 1, s > 2, there is 
a parameterized reduction from WCS][Cs,t,a] to WCS[C;,as]. Conclude that replacing Ct, a 
in Definition with Cs 4,4 for some fixed s > 2 would not change the definition of W[é]. 


13.7. Consider the WEIGHTED CIRCUIT SATISFIABILITY problem, restricted to monotone 
(i.e., without negations) circuits from the class C;,q for some fixed constant d. Prove that 
this problem is FPT when parameterized by k, the weight of the assignment in question. 
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13.8. Give a parameterized reduction from INDEPENDENT SET to SHORT TURING MACHINE 
ACCEPTANCE. 


13.9. Consider a restricted variant of the SHORT TURING MACHINE ACCEPTANCE problem, 
where we consider only instances (M, x, k) with x being an empty string. Reduce the general 
SHORT TURING MACHINE ACCEPTANCE problem to the restricted one. 


13.10. Work out the details of the parameterized reduction from PARTIAL VERTEX COVER 
to SHORT TURING MACHINE ACCEPTANCE (Theorem|}13.24). 


13.11. Prove that DOMINATING SET remains W/[2]-hard even if restricted to graphs ex- 
cluding the star Ky,4 as an induced subgraph. 


13.12. Give a parameterized reduction from DOMINATING SET to DOMINATING SET WITH 
PATTERN. 


13.13 ($). Modify the reduction in the proof of Theorem|13.26}to obtain a parameterized 
many-one reduction from CONNECTED DOMINATING SET to DOMINATING SET. 


13.14. Given a graph G and an integer k, the INDUCED MATCHING problem asks for an 
induced matching of size k, that is, k edges x1y1, ..., £kYk Such that the 2k endpoints are 
all distinct and there is no edge between {x;,y;} and {zj, yj} for any i # j. Prove that 
INDUCED Matcuina is W[{1]-complete. 


13.15. Given a graph G and an integer k, the INDEPENDENT DOMINATING SET problem 
asks for a set of exactly k vertices that is both an independent set and a dominating set. 
Prove that INDEPENDENT DOMINATING SET is W[2]-complete. 


13.16. In the Lona INDUCED PATH problem the input consists of a graph G and an integer 
k, and the question is whether G contains the path on k vertices as an induced subgraph. 
Prove that this problem is W[1]-hard when parameterized by k. 


13.17. Consider the following variant of the STEINER TREE problem: We are given a graph 
G, a set of terminals K C V(G), and an integer parameter k. The question is whether one 
can find a set X C V(G)\ K of cardinality at most k such that G[K U X] is connected. In 
other words, we parameterize the STEINER TREE by the number of nonterminals that can 
be added to the tree, while the number of terminals can be unbounded. Prove that this 
parameterization of STEINER TREE is W[2]-hard. 


13.18 (Æ). Is List Cotorina W[l]-hard parameterized by the vertex cover number? 
13.19. Give a parameterized reduction from Exact Opp SET to Exact Even SET. 


13.20 (8). Show that Unique HITTING Set and Exact Unique HITTING SET are in 
w{i]. 


13.21. Show that the graph versions of all five problems in Theorem[|13.32ļare W[1]-hard. 
(The graph version of Exact UNIQUE HITTING SET is called PERFECT CODE.) 


13.22. In the SUBSET Sum problem the input consists of a sequence of n integers 
a1,a2,...,an and two integers s,k, and the question is whether one can find a set 
I C {1,2,...,n} of exactly k indices such that Mier a; = s. Prove that SUBSET SUM 
is W[1]-hard when parameterized by k. 


13.23. In the SET PACKING problem the input consists of family F of subsets of a finite 
universe U and an integer k, and the question is whether one can find k pairwise disjoint 
sets in F. Prove that Ser Pacxina is W[1]-hard when parameterized by k. 
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13.24. In the Exact CNF-SAT problem the input consists of formula y in the conjunctive 
normal form and an integer k, and the question is whether one can find an assignment w 
of weight exactly k such that in each clause of y exactly one literal is satisfied. Prove that 
Exact CNF-SAT is W[1]-hard when parameterized by k. 


13.25. In the PERMUTATION COMPOSITION problem the input consists of a family P of 
permutations of a finite universe U, additional permutation m of U, and an integer k, 
and the question is whether one can find a sequence 71,72,...,7% E P such that 7 = 
7™10720...07%. Prove that PERMUTATION COMPOSITION is W[1]-hard when parameterized 
by k. 


13.26 (&). Modify the reduction of Theorem|13.33]in such a way that the constructed 
graph has no bidirected edges. 


13.27. In the VERTEX k-Way Cut problem the input consists of a graph G and two 
integers k,s, and the question is whether one can find a set X of at most k vertices such 
that the graph G — X has at least s connected components. Prove that VERTEX k-Way 
Cut is W[1]-hard when parameterized by k + s. 


13.28. Given a bipartite graph G with bipartite classes A,B C V(G) and an integer k, 
the HALL SET problem asks for a Hall set of size at most k, that is, a set S C A of size at 
most k such that |N(S)| < |S|. Show that HarL Set is W[1]-hard. 


13.29 ($). Recall that a graph is 2-degenerate if its every subgraph contains a vertex of 
degree at most 2. In the 2-DEGENERATE VERTEX DELETION problem one is given a graph 
G and an integer k, and the question is whether there exists a set X of at most k vertices 
such that the graph G — X is 2-degenerate. Prove that 2-DEGENERATE VERTEX DELETION 
is W[2]-hard when parameterized by k (if you find it more convenient, you may prove just 
W[1]-hardness). 


13.30 ($). In the LONGEST COMMON SUBSEQUENCE problem the input consists of a 
finite alphabet X, a set of strings s1,s2,...,s¢ € X* and an integer k, and the question is 
whether there exists a string s € X* of length k such that s is a subsequence of s; for every 
1 < i < s. Prove that LONGEST COMMON SUBSEQUENCE is W[1]-hard when parameterized 
by k. 


Hints 


The following algorithm satisfies the formal definition of a parameterized reduction: 
solve the VERTEX COVER instance in FPT time and output a trivial yes-instance or no- 
instance of INDEPENDENT SET. More generally, if A is FPT, then A has a parameterized 
reduction to any parameterized problem B that is nondegenerate in the sense that it has 
at least one yes-instance and at least one no-instance. 


Sample the tournament by choosing the orientation of each edge independently and 
uniformly at random. Prove that for n > r(k) the sampled tournament is k-paradoxical 
with positive probability. 


13.6| Every OR-node or AND-node with indegree at most s can be simulated with a chain 
of at most s — 1 nodes with indegree 2. This way, we can construct an equivalent circuit 
such that depth increases only by a factor of at most s. In fact, by replacing a node with 
indegree s with an (almost) complete binary tree of indegree-2 nodes, we can restrict the 
increase of depth to O(log s). 
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Observe that there is only a constant number of large gates. Guess the output of all 
large gates. For each large OR gate, guess which in-edge of this gate is satisfied, and what 
the assignment is of the input gates that make this particular in-edge satisfied (it depends 
only on a constant number of input gates). Finally, to resolve large AND-gates, design a 
simple branching algorithm. 

In the first k steps, the Turing machine guesses k vertices and writes them on the 
tape. In the next (5) phases, the Turing machine checks whether every pair of the guessed 
vertices is distinct and nonadjacent. Each phase can be implemented in O(k) steps. 


Modify the Turing machine in such a way that it writes x on the tape in the first || 
steps (and then the head returns to the starting position). 


13.11| Try to adjust the reduction for CONNECTED DOMINATING SET of Theorem]13.15 
so that the output graph has no Kj,4 as an induced subgraph. You would probably want 
to turn as many big parts of the graph into cliques as possible. 


13.12) The reduction of Theorem|13.15]from DOMINATING SET to CONNECTED DOMINAT- 
ING SET has the property that if the created instance has a connected dominating set of 
size k, then it has a dominating set of size k inducing a clique. 


Express CONNECTED DOMINATING SET as 2°(*") instances of DOMINATING SET 
WITH PATTERN and reduce them to 20(**) instances of DOMINATING SET (or SET Cover if 
you will). Then use the composition algorithm of Section [15.2.3] to create a single instance 
of SET Cover. 


Reduce from MULTICOLORED INDEPENDENT SET. Given a MULTICOLORED INDE- 
PENDENT SET instance (G,k,(Vi, V2,..., Vk)), aim at an instance (G’,k’) of Lone In- 
DUCED PaTH with k’ = 3k. In the intended solution, the (3i — 1)-th vertex of the path 
belongs to Vi, whereas the remaining vertices belong to some gadgets you introduce. 


Reduce from SET COVER. 


Note that treewidth of a graph is at most its vertex cover number. Therefore, the 
statement of Theorem [13.30] (W[1]-hardness parmeterized by treewidth) does not imply 
W[1]-hardness parameterized by vertex cover number, as vertex cover can be a much larger 
parameter. However, by looking at the proof of Theorem [13.30] we can observe that the 
reduction creates instances of List COLORING with vertex cover number at most k, which 
means that the same proof proves also the (stronger) result that the problem is W[1]-hard 
parameterized by the vertex cover number. 


By introducing a new element x and a new set {x} on it, we may assume that 
the parameter k of the Exact Opp SET instance is even. To create an instance of Exact 
Even SET, introduce a new element y, add it to every set, let us introduce a new set 
U \ {y}, and set k’ := k + 1 (which is an odd number). Now the set U \ {y} ensures that 
every solution of size exactly k’ contains y. Consequently, if A is a set of the Exact Opp 
SET instance, then the fact that the solution of the Exact Even SET instance has even 
intersection with AU {y} implies that it has odd intersection with A. 


We can prove membership by a reduction to SHORT TURING MACHINE ACCEP- 
TANCE. The Turing machine first guesses the (at most k) elements of the solution. Then 
by testing all the ($) pairs, it verifies that there is no set in the instance that contains 
two elements of the solution. Finally, to check that all the sets are hit by the solution, the 
Turing machine sums the total degree of the elements (that is, the number of sets they are 
contained in) of the solution. As no set is hit by more than one element of the solution, 
this sum is exactly the number of sets hit by the solution. To implement this algorithm in 
a Turing machine, we have to hard-code which pairs of elements are contained in some set 
together and the degree of each element. 
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13.22] Reduce from PERFECT Cope or Exact UNIQUE HITTING SET. 


13.23] Reduce from INDEPENDENT SET: for each vertex v of the input instance (G, k), 
create a set F, consisting of all edges of G incident to v. 


13.24] Reduce from PERFECT Cope or Exact UNIQUE HITTING SET. 
13.25] Reduce from PERFECT Cope or Exact UNIQUE HITTING SET. 


13.26] Split each nonterminal vertex v into two vertices vin and vout connected by an edge 
(Vin, Vout). Modify k’ appropriately. 


13.27] Try a reduction similar to the one given in the proof of Theorem}13.29 


Reduction from CLIQUE. Given a graph G, we construct a bipartite graph where 
class A corresponds to the edges of G and class B corresponds to the vertices of B; the 
vertex of A corresponding to edge uv of G is adjacent to the two vertices u,v € B. 
Additionally, we introduce a set of (5) — k — 1 vertices into B and make them adjacent to 
every vertex of A. Show that every Hall set of size at most (4) has size exactly (5) and 
corresponds to the edges of a k-clique in G. 


You may find the following characterization of d-degenerate graphs useful: a graph 
is d-degenerate if one can delete all the vertices of the graph one by one, each time removing 
a vertex that at the current moment has degree at most d. 

Start the reduction from SET Cover. Create k set-choice gadgets that emulate choices 
of k sets of the solution. Each gadget should be 3-regular, and should become 2-degenerate 
after removing any of its vertices. Then create a gadget for each element of the universe, 
and wire these gadgets with the set-choice gadgets to encode the input instance. Ensure 
the following properties: (i) If an element is not covered, then the closed neighborhood of 
the corresponding gadget induces a 3-regular graph untouched by the solution. (ii) If an 
element is covered, then the closed neighborhood of the corresponding gadget is affected 
by the solution and it is possible to “rip” the whole gadget in the process described in the 
previous paragraph. 


Reduce from MULTICOLORED CLIQUE. Let (G, k, (Vi, V2,..., Vk)) be a MULTICOL- 
ORED CLIQUE instance. The string s should encode the choice of vertices and edges of the 
clique, that is, V(G) U E(G) C X (you may need some additional special symbols in X 
that will serve as separators or markers). To this end, fix an arbitrary total order < on 
each set V;. Let v be the string consisting of all vertices of V; in the increasing order of 
<, and vý be the string A reversed. The orders on V; and Vj impose a lexicographical 
order on each set F; j of edges connecting V; and Vj in the graph G, and hence the strings 
El, and E}, are defined in a natural manner. Consider two strings 


sı = VIVi ... V} E] E] 3... Ef_1 p and 
— pry tpt pt 4 
s2 = VÝ VZ ... Vp EY 2 Fi 3--- Ekai, 
Observe that the longest common subsequence of sı and sz is of length k+ Ej and contains 
exactly one vertex v;i from each set V; and exactly one edge e;,; from each set Fj,;. 


Now, design a “gadget” consisting of two strings that ensures that v; is an endpoint of 
ei j, for each 1 <i < j < n. Similarly, verify the second endpoint of e;,;. 
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Bibliographic notes 


Downey and Fellows defined the notion of parameterized reductions and 
recognized that they can be used to transfer fixed-parameter intractability results. The 
W[1]-hardness of CLIQUE and INDEPENDENT SET on regular graphs, as well as the fact 
that this can be used to show that PARTIAL VERTEX Cover is W[1]-hard, was observed 
by Cai [64] and Marx [350]. The introduction of MuLTICOLORED Cuique and its W[1]- 
hardness is usually attributed to Fellows, Hermelin, Rosamond, and Vialette [179], but 
earlier Pietrzak studied the same problem under the name PARTITIONED CLIQUE. 

The W[2]-completeness of DOMINATING SET ON TOURNAMENTS was proved by Downey 
and Fellows [152]; however, the issue of how to construct efficiently a k-paradoxical tour- 
nament was glossed over. The probabilistic proof of the existence of small k-paradoxical 
tournaments, i.e., Theorem [13.12] was given by Erdős [166]. The best known determin- 
istic construction, given by Graham and Spencer [233], achieves size O(4* - k?) for a k- 
paradoxical tournament and is based on number-theoretical concepts. The combinatorial 
construction of Exercise [13.5] has been proposed by Tyszkiewicz [422]. The vast majority 
of parameterized reductions in the literature are actually polynomial-time reductions. The 
reduction to DOMINATING SET ON TOURNAMENTS presented in Theorem [13.14] is one of 
the rare exceptions: the running time depends superpolynomially on k. Other examples 
of such reductions can be found in the W/[1]-hardness proofs of the Vapnik-Chervonenkis 
(VC) dimension [152] and certain parameterizations of CLOSEST SUBSTRING [349] (see 
Theorem in Section [14.4}. 

The W-hierarchy was defined by Downey and Fellows {157}. 
These series of papers prove, among other results, the W[1]-completeness of INDEPENDENT 
SET eae and the characterization of W[t] by normalized circuit satisfiability 
(Theorem|13.19). The connection of W[1] and Turing machines was investigated by Cai, 
Chen, Downey, and Fellows [67]. They show W/[1]-hardness of SHorT TURING MACHINE 
ACCEPTANCE by a reduction from CLIQUE and show membership in W[1] by reducing 
SHORT TURING MACHINE ACCEPTANCE to WEIGHTED CIRCUIT SATISFIABILITY with weft- 
1 circuits. Together with the W[1]-hardness of INDEPENDENT SET (Theorem 13-18), this 
gives a reduction from SHORT TURING MACHINE ACCEPTANCE to INDEPENDENT SET. In 
Section [13-22] we worked out a direct reduction from SHORT TURING MACHINE AccEp- 
TANCE to INDEPENDENT SET to provide self-contained evidence (without the need for the 
W-hierarchy and Theorem[13.18) why INDEPENDENT SET is unlikely to be fixed-parameter 
tractable. 

Cesati observed that reduction to SHORT TURING MACHINE ACCEPTANCE can 
be a convenient way of proving membership in W[1]. We used this technique for PARTIAL 
VERTEX CovER (Theorem|13.24) and for Unique HITTING SET Ce 

The W[1]-hardness proof of BALANCED VERTEX SEPARATOR is by Marx [347]. The W[1]- 
hardness proof of List COLORING parameterized by treewidth is by Fellows, Fomin, Lok- 
shtanov, Rosamond, Saurabh, Szeider, and Thomassen [177]. In Section [14.29] we will see 
a different way of proving W[1]-hardness for List COLORING parameterized by treewidth, 
which also works for planar graphs. 

Downey and Fellows proved the W[1]-hardness of PERFECT Cope. Downey, Fel- 
lows, Vardy, and Whittle used the W[1]-hardness of PERFECT Cope to show (via com- 
plicated reductions) the W/[1]-hardness of variants of Opp Set (and of further problems 
that we do not discuss here). In Section we presented a very simple and self-contained 
W[1]-hardness proof for Opp SET that can be adapted for other variants. 

The W(2]-hardness of DOMINATING SET in graphs excluding K1,4 as an induced sub- 
graph has been proved independently by Cygan, Philip, Pilipczuk, Pilipczuk, and Woj- 
taszczyk and Hermelin, Mnich, van Leeuwen, and Woeginger [259]. It is worth men- 
tioning that, as proved in both of these papers, DOMINATING SET becomes FPT if restricted 
to graphs excluding K,,3 as an induced subgraph (i.e., in claw-free graphs). 
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The W[1]-hardness of various edge and vertex versions of STRONGLY CONNECTED 
STEINER SUBGRAPH was shown by Guo, Niedermeier, and Suchy and by Chitnis, 
Hajiaghayi, and Marx [92]. In Section [13.6] we have presented a somewhat simpler proof 
that works only for the vertex variant. 

Vertex deletion problems to d-degenerate graphs (Exercise[13.29) were studied by Math- 
ieson [360]. It appears that they are even harder than as stated in Exercise [13.29] 

Given a graph G and an integer k, the BICLIQUE problem asks for a Kk k subgraph, 
that is, two disjoint sets X,Y C V(G) of vertices of size exactly k such that every vertex 
of X is adjacent to every vertex of Y. The fixed-parameter tractability of BICLIQUE was a 
longstanding open question. Very recently, Lin proved that BicLiquE is W[1]-hard, 
resolving this question. (Exercise [13.3] considers the multicolored version of the problem, 
which can be shown to be W[1]-hard in a much easier way.) 


Chapter 14 


Lower bounds based on the 
Exponential-Time Hypothesis 


The Exponential Time Hypothesis (ETH) is a conjec- 
ture stating that, roughly speaking, n-variable 3-SAT 
cannot be solved in time 2°™. In this chapter, we 
prove lower bounds based on ETH for the time needed 
to solve various problems. In many cases, these lower 
bounds match (up to small factors) the running time 
of the best known algorithms for the problem. 





In the previous chapter, we have learned tools for distinguishing param- 
eterized problems that do admit fixed-parameter tractable algorithms from 
those that probably do not. However, as we have seen before, FPT algo- 
rithms come with a full variety of running times. On one hand, we have 
very efficient subexponential parameterized algorithms following from bidi- 
mensionality (see Section (7.7), with a typical running time of the form 
O* (20 (VF-polylog(k)) fi] On the other hand we have algorithms derived from 
Courcelle’s theorem, where the dependency on the treewidth of the graph is 
not even elementary. Somewhere in between lie “standard” fixed-parameter 
tractable problems amenable to other techniques, like branching, color cod- 
ing, iterative compression, kernelization, algebraic tools, representative sets, 
etc. For these problems the typical running time of an FPT algorithm is 
O*(200)), Ox (QO losk)) CE*(QPely(*)) or doubly exponential in k. There- 
fore, it seems that the class FPT has to contain an inner hierarchy of classes, 
corresponding to respective forms of parameter dependency of the running 
times of FPT algorithms. In order to uncover and describe this inner hier- 





1 Tn this chapter we focus on lower bounds on the parametric dependence of the running 
times of parameterized algorithms. For this reason, we extensively use the O*-notation that 
suppresses factors polynomial in the input size; recall that we defined it in Section [L-I] 
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archy, we need to develop a methodology for proving lower bounds: Given a 
particular problem, we need to establish an asymptotic lower bound on func- 
tion f that can appear in the running time of any FPT algorithm solving the 
problem. The assumption that FPT # W[1] seems too weak to achieve this 
goal, and therefore we will introduce stronger complexity assumptions. 

The Exponential Time Hypothesis (ETH) is a conjecture stating that, 
roughly speaking, 3-SAT has no algorithm subexponential in the number of 
variables. As we will see this conjecture implies that FPT #4 W[1], hence it 
can be also used to give conditional evidence that certain problems are not 
fixed-parameter tractable. More importantly, ETH allows us to prove quanti- 
tative results of various forms. For example, we can prove results saying that 
(assuming ETH) a problem cannot be solved in time 2°), or a parameterized 
problem cannot be solved in time f(k)n°™), or a fixed-parameter tractable 
problem does not admit a 2°*)nO™-time algorithm P| In many cases, the 
lower bounds obtained this way match (up to small factors) the best known 
algorithm, giving a tight understanding of the complexity of the problem. 
In recent years, the field of parameterized complexity has been completely 
revolutionized by the fact that such tight lower bounds can be proved. Ob- 
taining tight lower bounds seems to be a reachable goal for many problems 
and working towards this goal opens up new algorithmic questions to explore. 
A variant of ETH, called the Strong Exponential Time Hypothesis (SETH) 
can be used to give even more refined lower bounds saying, e.g., that a pa- 
rameterized problem cannot be solved in time (2 — e) nf ® for any e > 1. 

In Section [14.1] we introduce ETH and SETH, and derive classic conse- 
quences of the former. In Section we show how ETH can be used to 
establish lower bounds on the running times of FPT algorithms. Section [14.4] 
is devoted to the study of problems that are W[1]-hard and hence probably 
not FPT; it appears that for such problems ETH can be used to provide 
sharp estimates on the asymptotic behavior of the function of the parameter 
appearing in the exponent of n. Finally, in Section [F14.5] we present selected 
lower bounds based on SETH. 


14.1 The Exponential-Time Hypothesis: motivation and 
basic results 


The starting point of our considerations will be the hardness of the CNF-SAT 
problem. Recall that in the CNF-SAT problem we are given a propositional 
formula y on n Boolean variables x1, 22,...,2p, that is in conjunctive normal 
form (CNF). This means that p = Cy A C2 A... ^ Cm, where the Cis, called 
clauses, are of the form C; = (,V05V...VE;.,. Here, the £s are literals, that is, 








2 Recall that we say that f(n) € o(n) if f(n) < an for some unbounded and nondecreasing 


function s. 
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appearances of some variable in a negated or non-negated form. The question 
is whether there exists an assignment of true/false values to the variables so 
that p becomes true. By restricting the number of variables appearing in 
each clause to some constant q we arrive at the g-SAT problem. 

Since for q > 3 the g-SAT is NP-complete, we do not expect it to be solv- 
able in polynomial time. Our current knowledge, however, is very far from this 
unattainable goal. Obviously, the general CNF-SAT problem can be solved 
in time O*(2”) by trying all possible true/false assignments. Apparently, we 
do not know any algorithm that is substantially faster than this brute-force 
solution. Some improvement is possible, however, when the length of clauses 
is restricted: for every q > 3, there exists y, = 1 — oci) and an algorithm 
resolving g-SAT in time O*(2%"). For the most studied case of q = 3, the 
current champion achieves y3 < 0.387. 


The current status of research on satisfiability problems suggests that 
the following two natural barriers are hard to break: 


1. Obtaining a subexponential algorithm for 3-SAT, i.e., one with run- 
ning time 2°00., 

2. Finding an algorithm for the general CNF-SAT problem with run- 
ning time O*((2 — £)” ) for some constant € > 0. 


These two barriers motivate the complexity assumptions that we will in- 
troduce in a moment. For q > 3, let 5, be the infinimum of the set of constants 
c for which there exists an algorithm solving q-SAT in time O* (2°). In this 
definition, we allow only deterministic algorithms; we will discuss random- 
ized analogues in what follows. The Exponential-Time Hypothesis and Strong 
Exponential-Time Hypothesis are then defined as follows. 


Conjecture 14.1 (Exponential-Time Hypothesis, ETH). 


63 > 0 


Conjecture 14.2 (Strong Exponential-Time Hypothesis, SETH). 


lim 6, = 1 
q> 
Intuitively, ETH states that any algorithm for 3-SAT needs to search 

through an exponential number of alternatives, while SETH states that as 
q grows to infinity, brute-force check of all possible assignments becomes 
more and more inevitable. As we will see later, ETH actually implies that 
FPT 4 W[1], which supports the intuition that we are introducing stronger 
assumptions in order to obtain sharper estimates of the complexity of param- 
eterized problems. 


470 14 Lower bounds based on the Exponential-Time Hypothesis 


Looking back at the motivating barriers, note that ETH implies that 3- 
SAT cannot be solved in time 2°”), while SETH implies that CNF-SAT 
cannot be solved in time O*((2 — €)”) for any € > 0. However, the converse 
implications are unclear. It appears that the (possibly slightly stronger) state- 
ments of ETH and SETH, as stated in Conjectures [14-1]and are much 
more robust when it comes to inferring their corollaries via reductions. As 
we will see later, SETH actually implies ETH; see Theorem [14.5] 

Let us briefly discuss the possibility of defining the randomized versions of 
ETH and SETH. In Chapter [5] we have already seen a notion of one-sided er- 
ror in the case of Monte Carlo algorithms. One can also consider randomized 
algorithms with two-sided error. Such an algorithm both for any yes-instance 
and for any no-instance returns an incorrect answer with probability bounded 
by a constant that is strictly smaller than 1/2. Here, we modify the defini- 
tion of 6, by allowing also randomized two-sided error algorithms. Let 67 < ôq 
be the modified constant for gq-SAT. Then randomized ETH and randomized 
SETH state that 63 > 0 and limg,.. 67 = 1, respectively. Clearly, randomized 
ETH and SETH imply their deterministic analogues. In principle, whenever 
the existence of a deterministic algorithm with some running time can be ex- 
cluded under deterministic ETH, the same chain of reductions usually shows 
that a randomized algorithm with the same running time is excluded under 
randomized ETH. 

Allowing randomization in the definitions of ETH and SETH seems rea- 
sonable from the point of view of the research on satisfiability: the vast ma- 
jority of fast algorithms for q-SAT are inherently randomized, including the 
currently fastest O(2°-3°°") algorithm for 3-SAT. On the other hand, the as- 
sumption of allowing randomization is inconvenient from a pragmatic point 
of view: dealing with the error probability in multiple reductions that we 
present in this chapter would lead to more technical and obfuscated proofs. 
Therefore, we decided to use the deterministic versions of ETH and SETH 
as our main assumptions, and the reader is asked to verify the steps where 
randomization creates technical difficulties in Exercises [4.1]and [14.4] Hence- 
forth, whenever we are referring to an algorithm, we mean a deterministic 
algorithm. 

Before we proceed, let us remark that while ETH is generally considered 
a plausible complexity assumption, SETH is regarded by many as a quite 
doubtful working hypothesis that can be refuted any time. For this reason, 
lower bounds proved under the assumption of SETH should not be regarded 
as supported by very strong arguments, but rather that the existence of 
better algorithms would constitute a major breakthrough in the complexity 
of satisfiability. 

Most reductions from, say, 3-SAT, output instances of the target problem 
whose sizes depend on the actual size of the input formula, rather than on the 
cardinality of its variable set. However, an arbitrary instance of 3-SAT can 
have up to O(n?) distinct clauses; as we will see in the next chapter, this size 
cannot be nontrivially shrunk in polynomial time under plausible complexity 
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assumptions (see Theorem|15.29). Therefore, a reduction that starts from an 
arbitrary instance of 3-SAT and proceeds directly to constructing the output 
instance, would inevitably construct an instance of size at least cubic in the 
number of input variables. 

Suppose now that we are trying to prove a lower bound on the running time 
of algorithms solving the target problem. Then given such a reduction with 
a cubic blow-up, the best we could hope for is excluding an algorithm for the 
target problem that on any input instance x achieves running time gollel"), 
here, || is the size of instance x. Indeed, composing the reduction with such 
an algorithm would yield a 2°”) algorithm for 3-SAT, contradicting ETH. 
However, if we could somehow assume that the number of clauses of the input 
instance was linear in terms of n, then the reduction could give even an 2°(!#1) 
lower bound, providing that the output size is linear in the size of the input 
formula. 

Therefore, it seems natural to try to sparsify the input instance of 3-SAT 
(or more generally g-SAT), in order to be able to assume that the number 
of clauses is comparable to the number of variables. This is exactly the idea 
behind the milestone result of this theory, called the sparsification lemma. 


Theorem 14.3 (Sparsification lemma, [273]). For alle > 0 and positive 
q, there is a constant K = K(e,q) such that any q-CNF formula vy with n 
variables can be expressed as p = Via pi, where t < 2°" and each yh; is a 
g-CNF formula with the same variable set as p and at most Kn clauses. 
Moreover, this disjunction can be computed by an algorithm running in time 


Since the proof of the sparsification lemma is not necessary to understand 
the remainder of this chapter, we omit it in this book and refer the reader to 
original work for an exposition. 

Before we proceed, let us elaborate on what this result actually says. The 
sparsification lemma provides a reduction that sparsifies the given instance 
of g-SAT so that its total size is comparable to the size of its variable set. In 
order to circumvent the aforementioned incompressibility issues for q-SAT, 
we allow the reduction to use exponential time and output a logical OR of 
exponentially many instances (i.e., it is a Turing reduction, see Section[*13.5}, 
yet these exponential functions can be as small as we like. Thus, whenever 
reducing from g-SAT for a constant q, we can essentially assume that the 
number of clauses is bounded linearly in the number of variables. 

The following immediate consequence of the sparsification lemma will be 
our main tool for obtaining further corollaries. 


Theorem 14.4. Unless ETH fails, there exists a constant c > 0 such that no 
algorithm for 3-SAT can achieve running time O* (20+m)), In particular, 
3-SAT cannot be solved in time 200) 


Proof. Assume that for every c > 0 there exists an algorithm Ae that solves 
3-SAT in time O*(2"+™). We are going to show that for every d > 0 there 


472 14 Lower bounds based on the Exponential-Time Hypothesis 


exists an algorithm By that solves 3-SAT in time O*(2"), contradicting 
ETH. Fix any d > 0, and let K be the constant given by Theorem [14.3] for 
€ = d/2 and q = 3. Consider an algorithm 6, for the 3-SAT problem that 
works as follows: 


1. Apply the algorithm of Theorem [14.3]to the input formula y and param- 
eter € = d/2. 

2. Apply algorithm A, for d = HETI to each output formula y;, and 
return that ọ is satisfiable if at least one of the ys is. 


The correctness of the algorithm follows directly from Theorem [14.3] By 
Theorem the first step of the algorithm takes time O*(2%"), while the 


second takes time O*(2% - 220 (K+1)m) = O*(2%), as each formula Y; 


has at most Kn clauses and n variables. Hence, the whole algorithm runs in 
time O*(2"). 

















To see another application of the sparsification lemma, we will now show 
that SETH implies ETH. 


Theorem 14.5. If SETH holds, then so does ETH. 


Proof. For the sake of contradiction, assume that 63 = 0. Hence, for every 
c > 0 there exists an algorithm Ae that solves 3-SAT in time O*(2°"). We 
will show that this implies 6, = 0 for every q > 3, which contradicts SETH. 

Consider the following algorithm for g-SAT. Given the input formula y, 
first apply the algorithm of Theorem [14.3]for some € > 0, to be determined 
later. This algorithm runs in time O*(2*") and outputs at most 2°” formulas 
y, such that y is satisfiable if and only if any of the ys is, and each Y; has 
at most K(e,q)-n clauses. We now focus on resolving satisfiability of one 
Y = Yi- 

Consider the following standard reduction from q-SAT to 3-SAT: as long 
as there exists some clause C = 41 V 42 V ... V lp for some p > 4, add a fresh 
variable y and replace C with clauses Cy = 41 V42 Vy and Cz = 7yV03V...VEp. 
Observe that this transformation preserves satisfiability of the formula, and 
every original clause of length p > 3 gives rise to p — 3 new variables and 
p — 2 new clauses in the output formula. Hence, if we apply this reduction 
to the formula yw, we obtain an equivalent formula 7’ that is in 3-CNF and 
that has at most (1 + q- K(e,q))- n variables. If we now apply algorithm 
As to 7’, for some 0 < 6 < 1, then we resolve satisfiability of ~’ in time 
O*(2°") for ð = ô- (1 +q- K(e,q)). By applying this procedure to each 
of the listed formulas 4;, we resolve satisfiability of y in time O*(2°'") for 
6’ =e+0' =e+06-(1+q-K(e,q)). 

Recall that the constant £ in Theorem [14.3|can be chosen to be arbitrarily 
close to 0. Since 63 = 0, then having chosen £ > 0 (and thus K(e,q)) we 
can choose 6 so that 6’ is arbitrarily close to 0. Therefore, € and 6 can be 
chosen in such a manner that 6” is arbitrarily close to 0, which implies that 
bg = 0. 
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14.2 ETH and classical complexity 


The CNF-SAT and 3-SAT problems lie at the very foundations of the theory 
of NP-completeness. Problems around satisfiability of propositional formulas 
were the first problems whose NP-completeness has been settled, and the 
standard approach to prove NP-hardness of a given problem is to try to find 
a polynomial-time reduction from 3-SAT, or from some problem whose NP- 
hardness is already known. Thus, 3-SAT is in some sense the “mother of all 
NP-hard problems”, and the current knowledge about NP-completeness can 
be viewed as a net of reductions originating precisely at 3-SAT. Therefore, 
it should not be surprising that by making a stronger assumption about the 
complexity of 3-SAT, we can infer stronger corollaries about all the problems 
that can be reached via polynomial time reductions from 3-SAT. 

Consider, for instance, a problem A that admits a linear reduction from 
3-SAT, i.e., a polynomial-time algorithm that takes an instance of 3-SAT 
on n variables and m clauses, and outputs an equivalent instance of A whose 
size is bounded by O(n + m). Then, if A admitted an algorithm with run- 
ning time 2°), where |2| is the size of the input instance, then composing 
the reduction with such an algorithm would yield an algorithm for 3-SAT 
running in time 2°"+™), which contradicts ETH by Theorem Actu- 
ally, many known reductions for classic problems are in fact linear. Examples 
include VERTEX COVER, DOMINATING SET, FEEDBACK VERTEX SET, 3- 
COLORING, and HAMILTONIAN CYCLE; the reader is asked to verify this fact 
in Exercise [14-2] As a corollary we infer that none of these problems admits 
an algorithm running in subexponential time in terms of the instance size. 
Hence, while all of them can be solved in time O* (2%) for graphs on N ver- 
tices and M edges, the existence of algorithms with running time 2°) or 
even 2°) is unlikely] 


Theorem 14.6. Each of the following problems has a linear reduction from 
3-SAT: VERTEX COVER, DOMINATING SET, FEEDBACK VERTEX SET, 3- 
COLORING, HAMILTONIAN CYCLE. Therefore, unless ETH fails, none of 
them admits an algorithm working in time 2°N+™), where N and M are 
the cardinalities of the vertex and edge sets of the input graph, respectively. 


Of course, if the polynomial-time reduction from 3-SAT to the target 
problem has a different output size guarantee than linear, then the transferred 
lower bound on the complexity of the target problem is also different. The 
following basic observation can be used to transfer lower bounds between 
problems. 





3 In this chapter, we will consistently use n and m to denote the number of variables and 
clauses of a formula, respectively, whereas N and M will refer to the number of vertices 
and edges of a graph. 
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Observation 14.7. Suppose that there is a polynomial-time reduction from 
problem A to problem B that, given an instance x of A, constructs an equiv- 
alent instance of B having length at most g(|x|) for some nondecreasing func- 
tion g. Then an O*(2°(/(")))-time algorithm for B for some nondecreasing 
function f implies an O* (2° 9(¢))))-time algorithm for A. 


Therefore, in order to exclude an algorithm for a problem B with run- 
ning time O*(2°((121))), we need to provide a reduction from 3-SAT to 
B that outputs instances of size O(g(n + m)), where g is the inverse 


of f. 


Let us now make an example of an application of this framework where 
f and g are not linear. To this end, we will consider problems on planar 
graphs such as PLANAR VERTEX COVER, PLANAR DOMINATING SET, PLA- 
NAR FEEDBACK VERTEX SET, PLANAR 3-COLORING and PLANAR HAMIL- 
TONIAN CYCLE. The NP-hardness of these problems can be established by 
the means of a pivot problem called PLANAR 3-SAT. For an instance y of 
3-SAT, we can consider its incidence graph G, defined as follows: the vertex 
set of G, contains one vertex per each variable of y and one vertex per clause 
of p, and we put an edge between a variable x and a clause C if and only 
if z appears in C (either positively or negatively). In the PLANAR 3-SAT 
problem we require that the input formula of 3-SAT has a planar incidence 
graph. The following theorem, which we leave without a proof due to its 
technicality, provides a lower bound for PLANAR 3-SAT. 


Theorem 14.8 ([320]). There exists a polynomial-time reduction from 3- 
SAT to PLANAR 3-SAT that, for a given instance y with n variables and m 
clauses, outputs an equivalent instance vy! of PLANAR 3-SAT with O((n + 
m)?) variables and clauses. Consequently, PLANAR 3-SAT is NP-hard and 
does not admit an algorithm working in time 2°V"t+™ | unless ETH fails. 


We remark that the reduction of Theorem actually outputs a for- 
mula y’ and an ordering (£1, 22,..., En) of the variables of y’ such that Gys 
remains planar even after adding the edges x1 %2,%2%3,...,%n/—12n',Un/21- 
This property, called in what follows the existence of a variable circuit, turns 
out to be very useful in more complicated reductions. 

Roughly speaking, in the proof of Theorem one embeds the input 
formula arbitrarily on the plane, and then replaces each crossing of edges of 
Go by a constant-size crossover gadget that transfers the information via the 
crossing without disturbing planarity. Since the number of edges of G, is at 
most 3m, the number of crossings is at most 9m? and thus the bound on the 
size of the output formula follows. 

The gist of standard NP-hardness reductions for problems like VERTEX 
COVER, 3-COLORING or DOMINATING SET is to replace each variable x with 
a variable gadget H,, replace each clause C with a clause gadget Hc, and wire 
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the gadgets to simulate the behavior of the input instance of 3-SAT using 
the properties of the target problem. Due to Theorem we can use the 
same approach for planar problems as well: with a little bit of carefulness, 
replacing every variable and clause in a planar embedding of G, with a 
respective gadget will not disturb planarity. The reader is asked to verify the 
following theorem in Exercise [14.3] 


Theorem 14.9. Let A be any of the following problems: PLANAR VERTEX 
CovER, PLANAR DOMINATING SET, PLANAR FEEDBACK VERTEX SET. 
There exists a polynomial-time reduction that takes an instance p of PLANAR 
3-SAT with n variables and m clauses, and outputs an equivalent instance of 
A whose graph has O(n+m) vertices and edges. The same is true if A is PLA- 
NAR 3-COLORING or PLANAR HAMILTONIAN CYCLE, and y has a variable 
circuit. Consequently, unless ETH fails none of the aforementioned problems 
admits an algorithm with running time 20(V N), where N is the number of 
vertices of the input graph. 


Note, however, that all the problems considered in Theorem[14.9]can in fact 
be solved in time 20(W): this can be easily seen by pipelining the fact that a 
planar graph on N vertices has treewidth O(VN) (see Corollary |7.24}} with 
a single-exponential dynamic-programming routine on a tree decomposition 
(see Theorem and Theorem [11-13). Thus, we have obtained essentially 
matching upper and lower bounds on the complexity: achieving 20N) is 
possible, and it is hard to break this barrier. 


14.3 ETH and fixed-parameter tractable problems 


So far we have seen how ETH can be used to prove lower bounds on the 
classic complexity of NP-hard problems, i.e., complexity measured only in 
terms of the size of the input instance. We now proceed to the multivariate 
world: we shall present applications of ETH to parameterized problems, and 
in particular to problems that are fixed-parameter tractable. It turns out that 
ETH is a very convenient tool for proving lower bounds on the parameter 
dependence of the running time of FPT algorithms. In many cases these 
lower bounds (almost) match known algorithmic upper bounds. 





4 Note that Corollary not only proves an upper bound on treewidth, but also gives a 
polynomial-time algorithm which provides a desired decomposition. 
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14.3.1 Immediate consequences for parameterized 
complexity 


Observe that ETH and SETH can be seen as assumptions about the complex- 
ity of the g-SAT problems equipped with parameterization by the number of 
variables n. The sparsification lemma serves then as a reduction from param- 
eterization by n to parameterization by n+ m, and Theorem[14.4]establishes 
a lower bound for the latter. Therefore, if we consider 3-SAT as a source 
of parameterized reductions rather than classic ones, then we can infer lower 
bounds on the parameterized complexity of problems, instead of the standard 
instance-size complexity. 

Suppose, for instance, that a parameterized problem A admits a linear 
parameterized reduction from 3-SAT: a polynomial-time algorithm that takes 
an instance of 3-SAT on n variables and m clauses, and outputs an equivalent 
instance of A with parameter k bounded by O(n +m). If A admitted an FPT 
algorithm with running time O*(2°*)), then we could compose the reduction 
with this algorithm and thus solve 3-SAT in time 2°"+™, contradicting 
ETH by Theorem [14.4] Note here that we bound only the output parameter, 
while the whole output instance may have superlinear size. 

Again, for many classic FPT problems on general graphs, like VERTEX 
COVER or FEEDBACK VERTEX SET, the known NP-hardness reductions in 
fact yield linear parameterized reductions from 3-SAT. In the case of these 
two particular problems this conclusion is trivial: we already know that VER- 
TEX COVER and FEEDBACK VERTEX SET admit classical linear reductions 
from 3-SAT, and parameters in meaningful instances are bounded by the 
cardinality of the vertex set. This shows that both these problems, and any 
others that admit linear parameterized reductions from 3-SAT, probably do 
not admit subexponential parameterized algorithms, that is, FPT algorithms 
with running time O*(2°(*)). 

Similarly as before, we may prove lower bounds for different forms of FPT 
running time by having different guarantees on the output parameter. This 
is encapsulated in the following analogue of Observation 


Observation 14.10. Suppose that there is a polynomial-time parameterized 
reduction from problem A to problem B such that if the parameter of an 
instance of A is k, then the parameter of the constructed instance of B is 
at most g(k) for some nondecreasing function g. Then an O*(2°/*)))-time 
algorithm for B for some nondecreasing function f implies an O* (2°\/(9(*)))) 
algorithm for A. 


Therefore, in order to exclude an algorithm for a parameterized problem 
B with running time O*(2°\/*))), we need to provide a reduction from 
3-SAT to B that outputs instances with the parameter k bounded by 
O(g(n+m)), where g is the inverse of f. 
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Observe that formally a nondecreasing function f might not have an in- 
verse. For that reason throughout the chapter we slightly overuse the defini- 
tion of an inverse function, and for a nondecreasing function f its inverse is 
any function g satisfying g(f(x)) = O(2). 

Again, to see how this framework works in practice, consider the exam- 
ples of PLANAR VERTEX COVER, PLANAR DOMINATING SET, and PLANAR 
FEEDBACK VERTEX SET problems. By composing Theorems [14.8] and 
we obtain NP-hardness reductions for these problems that start with a 3- 
SAT instance with n variables and m clauses, and output instances of the 
target problem with at most O((n+m)?) vertices. Obviously, we can assume 
that the requested size of vertex cover /dominating set /feedback vertex set 
in the output instance is also bounded by O((n + m)?). Thus, following the 
notation of Observation these reductions can serve as parameterized 
reductions from 3-SAT for a quadratic function g. The following corollary is 
immediate. 


Theorem 14.11. Unless ETH fails, none of the following problems admits 
an FPT algorithm running in time O* (2°\V*)); PLANAR VERTEX COVER, 
PLANAR DOMINATING SET, PLANAR FEEDBACK VERTEX SET. 


mentioned in Theorem |14.11] that run in time O*(20(V*)) (in the cases 
of PLANAR VERTEX COVER and PLANAR DOMINATING SET) or in time 
O* (20(VFlesk)) (in the case of PLANAR FEEDBACK VERTEX SET), using the 
technique of bidimensionality (see Corollaries [7.30] and [7.31}. In fact, by us- 
ing the faster dynamic-programming routine for FEEDBACK VERTEX SET 
from Theorem we can solve PLANAR FEEDBACK VERTEX_SET also 
in time O*(2°(V")). Therefore, the lower bounds of Theorem [14.11] essen- 
tially match the upper bounds for all the considered problems. This shows 
that appearance of the square root in the exponent is not just an artifact 
of the technique of bidimensionality, but rather an intrinsic property of the 
problems’ complexity. 


Recall that in Chapter f we have given algorithms for all the problems 


*14.3.2 Slightly super-exponential parameterized 
complexity 


One class of problems, for which we have some understanding of the method- 
ology of proving sharp lower bounds, is the problems solvable in slightly super- 
exponential parameterized time, i.e., in time O* (20 !8*)) for the parameter 
k. The running time of such a form appears naturally in the following situa- 
tions: 
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(i) A seemingly optimal FPT algorithm is a branching procedure that 
performs O(k) guesses, each time choosing from a set of possibilities 
of cardinality KO“). An example of such algorithm is the one given 
for CLOSEST STRING in Section 

(ii) The parameter is the treewidth t of a given graph, and the natural 
dynamic-programming routine has 20('1°8*) states per bag of the 
tree decomposition, usually formed by partitions of the bag. An 
example of such an algorithm is the one given by Theorem [7.8] for 
the STEINER TREE problem. 


By excluding the existence of an O*(2°*!°8*)) algorithm for the prob- 


lem under ETH, we can support the claim that the obtained slightly super- 
exponential upper bound is not just a result of the approach taken, but such 
running time is necessary because of the nature of the problem itself. 

We now aim at providing a base set of auxiliary problems that will be help- 
ful for further reductions. At the end of this section we will see some examples 
of how they can be utilized for proving lower bounds for ‘real-life’ parameter- 
ized problems, say of type (i) or (ii) from the list above. The following problem 
will be our ‘canonical’ example where slightly super-exponential running time 
is both achievable and optimal under ETH. Intuitively, it reflects a generic 
situation with which we are faced when approaching a parameterized problem 
of type (i). 

In the k x k CLIQUE problem we are given a graph H on a vertex set 
[k] x [k]. In other words, the vertex set is formed by a k x k table, and 
vertices are of form (i,j) for 1 < i,j < k. The question is whether there 
exists a clique X in H that contains exactly one vertex from each row of the 
table, i.e., for each i € [k] there is exactly one element of X that has i on 
the first coordinate. Note that without loss of generality we may assume that 
each row of the table forms an independent set. 

Obviously, k x k CLIQUE can be solved in time O*(k*) by trying all possi- 
ble choices of vertices from the consecutive rows. It appears that this running 
time is essentially optimal assuming ETH, which is proved in the following 
theorem. In the proof, we will use the fact that the 3-COLORING problem 
admits a linear reduction from 3-SAT, and hence does not admit a subex- 
ponential algorithm in terms of N + M, unless ETH fails; see Theorem [14.6] 
and Exercise [14.2 


Theorem 14.12. Unless ETH fails, k x k CLIQUE cannot be solved in time 
O* (goth log k)), 


Proof. We shall present a polynomial-time reduction that takes a graph G on 
N vertices, and outputs an instance (H, k) of kxk CLIQUE such that (i) (H, k) 
is a yes-instance if and only if G is 3-colorable, and (ii) k = O(N/log N). 
Composing this reduction with an O*(2°(*!°8*)) algorithm for k x k CLIQUE 


14.3 ETH and fixed-parameter tractable problems 479 
would yield an algorithm for 3-COLORING working in time 2°? 
contradict ETH by Theorem [14.6] 

Let k = [ 24. ]. Arbitrarily partition the vertex set of the input graph into 


log; N 
k parts Vi, V2,...,Ve, so that each part V; has cardinality at most pee aj; 


For every part V;, list all the possible 3-colorings of G[V;]; note that there 
logg N 


are at most 31%! < 3l 3—1 < 3VN such colorings. If no coloring has been 
found for some part, we conclude that we are dealing with a no-instance. 
Observe also that without loss of generality we may assume that 3V N < k, 
since otherwise N is bounded by a universal constant and we may solve the 
input instance by brute-force. 

Duplicate some of the listed colorings if necessary, so that for each part 
V; we have a list of exactly k 3-colorings. Let c}, c?,...,c¥ be the 3-colorings 
listed for V;. For each i € [k] and j € [k], create a vertex (i, j). For every pair 
of vertices (i1, j1) and (i2, j2) make them adjacent if and only if i1 A i2 and 
ch U cœ}? is a valid 3-coloring on G[V;, U V;,]. This concludes the construction 
of the graph H, and of the output instance (H, k) of k x k CLIQUE. 

We now argue equivalence of the instances. If c is a valid 3-coloring of G, 
then cly, is a valid 3-coloring of G[V;]. Hence, for each i € [k], there exists 
an index j; € [k] such that c;, = cly,. By the construction, it follows that 
vertices (i, j;) for i € [k] form a k-clique in H: for each i1, i2 € [k], i1 Æ i2, 


, which would 











we have that ci U ie is equal to c 
of G[V;, U Vi]. 

Conversely, assume that a set {(i, ji) : i € [k]} induces a k-clique in H. 
We claim that c = Uj, ¢/' is a valid 3-coloring of G. Take any uv € E(G); 
we prove that c(u) # c(v). If u,v € V; for some i € [k], then we have 
c(u) # c(v) since c” = cly, is a valid 3-coloring of G[V;]. Suppose then that 
u € Vi, and v € Vi, for 41 A ig. Since (i1, ji) and (i2, jip) are adjacent in 
H, we have that oe U a? is a valid 3-coloring of G[V;, U V;,], and hence 


c(u) = cf (u) # fi? (v) = e(o). 


In the kxk CLIQUE problem the constraint is that each row of the table has 
to contain one vertex of the clique. It is natural to ask about the more robust 
version of this problem, where we additionally require that each column of the 
table also needs to contain one vertex of the clique, or, equivalently, that the 
vertices of the clique must induce a permutation of |k]. We call this problem 
k x k PERMUTATION CLIQUE. The following theorem provides a randomized 
lower bound for this version. 





V UV; and hence is a valid 3-coloring 














Theorem 14.13. Assume there exists an algorithm solving k x k PERMU- 
TATION CLIQUE in time O*(2°*!98*)). Then there exists a randomized algo- 
rithm solving k x k CLIQUE in time O*(2°"!8*)). The algorithm can have 
only false negatives, that is, on a no-instance it always gives a negative an- 
swer, whereas on a yes-instance it gives a positive answer with probability at 
least Z. 


480 14 Lower bounds based on the Exponential-Time Hypothesis 


Proof. Suppose that we are given an algorithm for k x k PERMUTATION 
CLIQUE that works in time O*(2°\*!°8*)). We are going to present a random- 
ized algorithm for k x k CLIQUE with the stated specification. 

Let (H, k) be the input instance of k x k CLIQUE. Consider the following 
algorithm: for each row i of the table, uniformly and independently at random 
pick a permutation 7; of [k], and permute the vertices of this row according 
to m; keeping the original adjacencies. More precisely, if we denote by H’ 
the obtained graph, then V(H’) = [k] x [k] and vertices (i, j) and (i, j’) are 
adjacent in H’ if and only if vertices (i, 7; '(j)) and (i’,7;,'(y’)) are adjacent 
in H. Apply the supposed algorithm for k x k PERMUTATION CLIQUE to the 
instance (H’,k); if this algorithm has found a solution to (H’,k), then return 
a positive answer, and otherwise provide a negative answer. 

Let m be the permutation of [k] x [k] that defines the graph H’, that 
is, m(i, j) = (i, m(j)). Obviously, if a solution X’ in the instance (H’,k) of 
k x k PERMUTATION CLIQUE has been found, then in particular 7~!(X’) is 
a solution to the input instance (H,k) of k x k CLIQUE. Therefore, if the 
algorithm returns a positive answer, then it is always correct. Assume now 
that the input instance (H, k) admits some solution X; we would like to show 
that with high probability the shuffled instance (H’,k) admits a clique that 
induces a permutation of [k]. Let X’ = 1(X). Obviously, X’ is still a clique in 
H’ and it contains one vertex from each row; we claim that with high enough 
probability it also contains one vertex from each column. 

Let fx and fx be functions from [k] to [k] such that (i, fx(i)) € X and 
(i, fx: (i)) € X’ for each i € [k]. We have that fx:(i) = m;(fx(i)) for each 
i € [k], and we need to provide a lower bound on the probability that fx 
is a permutation. However, by the choice of 7, for every index i € [k] the 
value of 7;(fx(i)) is chosen uniformly and independently at random among 
the elements of [k]. Hence, every function from [k]!*] is equally probable as 
fx’. Since there are k! permutations of [k], and k* functions in [k]!*! in total, 
we infer that x! 


Pr( fx. is a permutation) = a >e 


—k 


The last inequality follows from the known fact that k! > (k/e)*; recall that 
we used this inequality in the same manner in the proof of Lemma[f5.4] Hence, 
with probability at least e~* the instance (H’, k) will be a yes-instance of k x k 
PERMUTATION CLIQUE, and the algorithm will provide a positive answer. 
Using the standard technique of independent runs (see Chapter |5), we 
repeat the algorithm e” times in order to reduce the error probability from 
at most 1 — e~* to at most (1 —e7*)® < e-® "e" = 1/e < 1/2. Note that 


thus the whole procedure runs in time e* -O*(2°(*!98*)) = O* (20k log k)), 














The presented proof of Theorem [14.13] gives a randomized Turing reduc- 
tion from k x k CLIQUE to k x k PERMUTATION CLIQUE. It is therefore easy 
to see that the existence of an O*(2°\"!°8*)) algorithm for k x k PERMUTA- 
TION CLIQUE can be refuted under the stronger assumption of randomized 
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ETH; the reader is asked to verify this fact in Exercise [14.4] However, since 
the reduction of Theorem [14.13] is randomized, we cannot a priori state the 
same lower bound under deterministic ETH. Fortunately, the construction 
can be derandomized using perfect hash families, and thus we may obtain 
a deterministic Turing reduction. Hence, we have in fact the following lower 
bound; we omit its proof due to technicality. 


Theorem 14.14 ([326]). Unless ETH fails, k x k PERMUTATION CLIQUE 
cannot be solved in time O* (2° 108K) ), 


Let us now define the k x k HITTING SET problem: we are given a family 
F of subsets of [k] x [k], and we would like to find a set X, consisting of 
one vertex from each row, such that X N F 4 @ for each F € F. Again, in 
the permutation variant we ask for a set X that induces a permutation of 
[k]. Hardness for k x k (PERMUTATION) HITTING SET follows from an easy 
reduction from k x k (PERMUTATION) CLIQUE. 


Theorem 14.15. Unless ETH fails, k x k (PERMUTATION) HITTING SET 
cannot be solved in time O* (2° 8), 


Proof. Let (H,k) be an instance of kx k (PERMUTATION) CLIQUE. Construct 
an instance (F, k) of kx k (PERMUTATION) HITTING SET as follows: for each 
pair of nonadjacent vertices (i1, 71) and (i2, j2) where 71 Æ i2, introduce a set 
F € F that consists of all the vertices in rows 7; and i2 apart from vertices 
(i1, j1) and (i2, j2). Thus, any set X that contains one vertex from each row 
has a nonempty intersection with F if and only if both pairs (i1, j1) and 
(i2, j2) are not included in X simultaneously. It follows that any such X is 
a solution to the instance (F, k) of k x k (PERMUTATION) HITTING SET if 
and only if it is a solution to the instance (H,k) of k x k (PERMUTATION) 
CLIQUE, and so the instances (H,k) and (F,k) are equivalent. The theorem 


follows from an application of Theorems |14.12]and [14.14 


It turns out that for further reductions it is most convenient to have one 
more assumption about the k x k (PERMUTATION) HITTING SET problem. 
Namely, we say that an instance (F, k) of one of these problems uses only 
thin sets if each F € F contains at most one vertex from each row. Note that 
the instance obtained in the reduction of Theorem [14.15] does not have this 
property; however, hardness can be also obtained under this assumption. 














Theorem 14.16 ([326]). Unless ETH fails, kx k (PERMUTATION) HITTING 
SET WITH THIN SETS cannot be solved in time O* (20 !°8*)), 


The proof of Theorem [14.16] is a chain of three technical reductions, and 
we give it as Exercises [14.6]and 

As already mentioned in the beginning of this section, the presented prob- 
lems form a convenient base for further reductions. We now provide one such 
example. The problem of our interest will be the CLOSEST STRING problem, 
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Fig. 14.1: Example of the reduction of Theorem [14.17] applied to an instance 
of 5 x 5 PERMUTATION HITTING SET WITH THIN SETS that has three sets, 
with each set containing at most one element from each row. The elements 
of these sets are denoted by circles, squares, and hexagons, respectively 


studied in Section In this problem, we are given a finite alphabet X, a 
set of strings {£1, £2,..., £n} over X, each of length L, and an integer d. The 
question is whether there exists a string y € ©” such that x; and y differ on 
at most d positions, for each 7. In Section [3.5] we have shown how to solve 
CLOSEST STRING in time O*((d + 1)“) whereas Exercise asked for an 
analysis of an O*(c“|S)|“)-time algorithm for some constant c. We can now 
show that both these running times are essentially optimal. 


Theorem 14.17. Unless ETH fails, CLOSEST STRING cannot be solved in 
O* (204184) nor O* (2°48 1~1)) time. 


Proof. We provide a polynomial-time algorithm that takes an instance (F, k) 
of k x k PERMUTATION HITTING SET WITH THIN SETS, and outputs an 
equivalent instance (27, d,{1,¥2,...,%n}) of CLOSEST STRING with L = k, 
d= k—1 and |X| =k+1. If there existed an algorithm for CLOSEST STRING 
with running time O*(2°¢!°8%) or O*(2°(¢!8|*1)), then pipelining the re- 
duction with this algorithm would give an algorithm for k x k PERMUTATION 
HITTING SET WITH THIN SETS with running time O*(22(*!°8*)), contradict- 
ing ETH by Theorem 

Let X = [k] U{%} for some additional symbol x that the reader may view 
as ‘mismatch’. First, introduce k strings x1, %2,..., £p such that x; = if, i.e., 
symbol i repeated k times. Then, for every F € F introduce a string zp 
constructed as follows. For each i € [k], if F contains some pair (i, j;) in row 
i, then put j; on the i-th position of zp. Otherwise, if no such pair exists, put 
x on the i-th position of zr. Note that this definition is valid due to all the 
sets being thin. Finally, we set d = k — 1 and conclude the construction; see 


Fig. for a small example. 
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We now formally argue that instances (F,k) and (X,k — 1,{x; : i€ 
[k]}U{zp : F € F}) are equivalent. Assume first that X is a solution to the 
instance (F, k). For each i € [k], let j; € [k] be the unique index such that 
(i, ji) E€ X. Construct a string y by putting ji on the i-th position of y for 
each i € [k]. Note that since X induces a permutation of [k], then for every 
j € [k] there exists i € [k] such that j = ji, and so y and a; differ on at most 
k — 1 positions. Moreover, since X N F is nonempty for each set F € F, we 
have that y coincides with zp on at least one position, so it differs on at most 
k — 1 positions. Hence y is a solution to the constructed instance of CLOSEST 
STRING. 

Conversely, take any solution y of the constructed CLOSEST STRING in- 
stance. For each j, we have that x; and y differ on at most k—1 positions, so 
they coincide on at least one. Therefore, y needs to contain every j € [k] at 
least once. Since |y| = k, then we infer that y contains every j € [k] exactly 
once, and it does not contain any w. Let X = {(i,y[z]) : i € [k]}. We already 
know that X C [k] x [k] and that X induces a permutation of [k]. We are 
left with proving that X has a nonempty intersection with every F € F. We 
know that y and zp coincide on at least one position, say i. As y does not 
contain any x, we have that zpļi] = yli] = ji for some j; € [k]. Consequently 
(i, ji) € X and (i, ji) € F by the definitions of X and zp, so XN F #90. 














Another important problem that roughly fits into the same category as 
CLOSEST STRING is DISTORTION, defined as follows. We are given a graph G 
and an integer d. The task is to find an embedding 7 of the vertices of G into 
Z such that dist(u, v) < |n(u)—n(v)| < d-dist(u, v) for all u,v € V(G), where 
dist(u, v) is the distance between u and v in G. In other words, one is asked to 
embed the graph metric into the linear structure of Z, so that the distances 
are stretched at most d times. The problem of embedding various (graph) 
metrics into low-dimensional vector spaces is the subject of intensive studies 
both from a purely mathematical, and from a computational point of view; 
DISTORTION is thus the most basic and fundamental problem in this theory. 
Surprisingly, DISTORTION is fixed-parameter tractable when parameterized 
by the stretch d, and it admits an O*(d?) algorithm. Using the methodology 
introduced in this section one can prove that this is optimal under ETH; we 
omit the proof. 


Theorem 14.18 ([176) ). DISTORTION can be solved in time O*(d*). 
Moreover, unless ETH fails there is no O* (20(4 10g d) algorithm for this prob- 
lem. 


Recall that another class of problems where the complexity of the form 
O*(20@los8t)) appears naturally, are treewidth t parameterizations where the 
natural space of states of the dynamic-programming routine is formed by 
partitions of the bag. In Chapter|7| we have presented one such dynamic pro- 
gram running in time O*(20(¢!89), for the STEINER TREE problem. While 
in Chapter [11] we have seen that STEINER TREE can be solved faster, in time 
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O* (2°), for some other problems the presented framework can be used to 
show that the running time of O*(20(!°8")) is essentially optimal. The next 
theorem, which we leave without a proof, provides some examples. 


Theorem 14.19 ([118, [326]). Unless ETH fails, the following problems 
cannot be solved in time O* (200 108 8), where t is the width of a given path 
decomposition of the graph: CYCLE PACKING, VERTEX DISJOINT PATHS. 


Recall that in the CYCLE PACKING problem we want to decide whether a 
given graph has k vertex-disjoint cycles, while in VERTEX DISJOINT PATHS 
given k pairs of vertices the question is whether one can connect those pairs 
by vertex-disjoint paths. Note that Theorem [14-19]is stated in terms of a path 
decomposition and not a tree decomposition, which makes the lower bound 
stronger. 

The idea behind the proof of Theorem [14.19]is to reduce from k x k (PER- 
MUTATION) HITTING SET WITH THIN SETS and try to embed the search space 
of this problem, which has size roughly k*, into the states of the standard 
dynamic program working on a path decomposition. We create a graph which 
has a long path decomposition of width O(k) and using problem-dependent 
combinatorics we make sure that the choice the solution makes on the first 
bag (for instance, a choice of a matching for the CYCLE PACKING problem) 
is propagated along the decomposition to all the other bags. This propagated 
choice has to reflect the intended solution of the input k x k (PERMUTATION) 
HITTING SET WITH THIN SETS instance. Then we attach multiple gadgets 
along the path decomposition, one for every set to be hit. Each gadget should 
have O(k) pathwidth, and its role is to verify that the propagated solution 
from the first bag indeed hits the corresponding set. Thus, all the gadgets are 
satisfied if and only if the chosen and propagated solution hits all the sets. We 
will look more closely on reductions of this type in Section where we 
discuss optimality of bounded-treewidth dynamic programming under SETH. 


In essence, the outcoming message is that the nature of problems ex- 
emplified by those mentioned in Theorem [14.19] makes it necessary to 
have 2°%(¢les*) states per bag of the decomposition. The space of states 
cannot be pruned substantially, since each state is, roughly speaking, 
meaningful on its own. 


*14.3.3 Double exponential parameterized complexity 


Recall that in Section we have considered the EDGE CLIQUE COVER 
problem: given a graph G and an integer k, verify whether G contains k 
subgraphs C1, C2,...,C, such that each C; is complete and Ui E(C;) = 
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E(G). Already very simple reduction rules yield a kernel for EDGE CLIQUE 
Cover with at most 2” vertices, and the problem can be solved in time 
O* (22°) by performing SET Cover-like dynamic programming on the edge 
set of the kernel, similar to the one of Section In the light of our lower 
bound methodology, it is natural to ask whether such running time is optimal 
under ETH. This appears to be the case. 


Theorem 14.20 ([120]). There exists a polynomial-time algorithm that, 
given a 3-SAT instance p on n variables and m clauses, constructs an 
equivalent EDGE CLIQUE COVER instance (G,k) with k = O(logn) and 
|V(G)| = O(n + m). Consequently, EDGE CLIQUE COVER cannot be solved 


in time O*(2?°) unless ETH fails. 


The proof of Theorem |14.20]is a technical reduction that we omit in this 
14.20] 


book. Again, Theorem |14.20|shows that, despite the simplicity of the kernel- 
ization approach, nothing substantially better can be achieved. The reason 
for this lies not in the naivety of the technique, but in the problem itself. 


14.4 ETH and W[1]-hard problems 


In Chapter [13] we have seen techniques for giving evidence that a problem is 
not fixed-parameter tractable, that is, the parameter k has to appear in the 
exponent of the running time. But we have not explored the question of how 
exactly the exponent has to depend on k for a W[1]-hard problem: for all we 
know, it is possible that a W[1]-hard problem has a 20(*) . nO (og leg log k)_time 
algorithm, which would be “morally equivalent” to the problem being FPT. 
In Section[14.3] we have seen how ETH can be used to classify FPT problems 
by giving (often tight) lower bounds on the function f(k) appearing in the 
running time. It turns out that ETH also allows us to prove (often tight) 
lower bounds on the asymptotic behavior of the function g in an f(k)n9™) 
time algorithm. The first such lower bound result we present is for CLIQUE 
(or, equivalently, for INDEPENDENT SET). 


Theorem 14.21. Assuming ETH, there is no f(k)n° -time algorithm for 
CLIQUE or INDEPENDENT SET for any computable function f. 


Proof. We will show that if there is an f(k)n°\")-time algorithm for CLIQUE 
for some computable function f, then ETH fails. Suppose that CLIQUE on 
every graph H can be solved in time f(k)|V(H)|*/8™, where s(k) is a nonde- 
creasing unbounded function. We use this algorithm to solve 3-COLORING on 
an n-vertex graph G in time 2°”; by Theorem [14.6] this contradicts ETH. 
The reduction is similar to the proof of Theorem [14.12] but instead of split- 
ting the vertices into k = O(n/logn) groups, the number k of groups now 
depends on the function f. 
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We split the vertices in the 3-COLORING instance into roughly k = 
f~t (n) groups, where f~! is the inverse of f (e.g., if f(k) = 2°, then 
f=! (n) = @(logn)). We create a CLIQUE instance where each vertex 
represents a 3-coloring of one of the groups. The created graph H has 
size at most k-3"/* = k-3"/f-"(™) = 2°"), We can observe that the 
running time of the assumed f(k)|V(H)|°“)-time algorithm for CLIQUE 
on H is 2°"): we have f(k) = f(f—!(n)) < n and |V(H)|°™) is roughly 
(sere) 2 go0(n). 


We need to be very careful when defining the number k of groups we need; 
one reason is that we have no assumption on the time needed to compute 
f(k), besides the fact that it is computable. We may assume that the com- 
putable function f satisfies f(k) > max{k,k*/*} for every k > 1 (e.g., by 
replacing it with the computable function f'(k) = max{ f(k), k, k*/s} if 
necessary). Let us start computing the values f(k) for k = 1,2,...; we stop 
this computation when either a k with f(k) > n is reached, or when the total 
number of steps used so far exceeds n. Let k be the last value for which we 
computed f(k) and we have f(k) < n (we may assume that f(1) < n and 
we can compute f(1) in n steps, otherwise n is bounded by a universal con- 
stant). The value k selected this way is a function of n only, that is, we have 
computed the value k = g(n) for some computable function g(n). Moreover, 
computing k = g(n) took only time O(n). Note that from the way function 
g is defined it follows that it is nondecreasing, unbounded, and f(g(n)) < n. 
Moreover, the assumption f(k) > k and the inequality f(g(n)) < n imply 
that g(n) < n. 

Split the vertices of G into k groups V1, ..., Vk of size at most [n/k] each. 
Let us build a graph H where each vertex corresponds to a proper 3-coloring 
of one of the groups. The number of vertices in H is at most 


|V(H)| < k- gink] < k. 320/k) — p. 32(n/9(n)) — g0(n) 


(we used that k = g(n) < n in the second inequality and the fact that g(n) is 
nondecreasing and unbounded in the last equality). We connect two vertices 
of H if they are not conflicting. That is, if u represents a coloring of G[V;] 
and v represents a coloring of G[V;] for some i # j, then u and v are adjacent 
only if the union of the colorings corresponding to u and v is a valid coloring 
of G[V; UV;]. It is easy to verify that a k-clique of H corresponds to a proper 
3-coloring of G and one can construct this coloring in time polynomial in the 
size of H, that is, in time 2°”). Therefore, using the assumed algorithm for 
CLIQUE, a 3-coloring of G can be found in time 
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FIV (ENEO < nfk -31/81/20 
n(k - 32r/#)k/s(h) 


using f(k) = f(g(n)) < n) 
using k = g(n) < n) 


IA 


s(k) is nondecreasing) 


using k*/8™) < f(k) <n) 


n2 . 32n/s(g(n)) 
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( 
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<n K/S . 32n/sa(n)) 
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oe), function s(g(n)) is 


nondecreasing and unbounded) 


Therefore, we have shown that the assumed algorithm would contradict ETH. 














In particular, Theorem |14.21|shows that, assuming ETH, CLIQUE is not 
FPT. Therefore, Theorem |14.21|shows that ETH implies that FPT 4 W[1 


Having established a tight lower bound for CLIQUE, we can easily transfer 
it to other problems with parameterized reductions. As in Observation [14.10] 
the strength of the lower bound on the target problem depends on the bound 
on the new parameter in the reduction. 


Observation 14.22. Suppose that there is a polynomial-time parameterized 
reduction from problem A to problem B such that if the parameter of an 
instance of A is k, then the parameter of the constructed instance of B is 
at most g(k) for some nondecreasing function g. Then an f(k) -n°("))-time 
algorithm for B for some computable function f and nondecreasing function 
h implies an f(g(k)) - n°0) algorithm for A. 


Therefore, in order to exclude an algorithm for a parameterized problem 
B with running time f(k): n°) for any computable function f, we 
need to provide a reduction from CLIQUE to B that outputs instances 
with the parameter k bounded by O(g(k)), where g is the inverse of h. 


Observing that most of the parameterized reductions in Chapter [13] have 
linear parameter dependence, we can obtain tight lower bounds for a number 
of problems. 


Corollary 14.23. Assuming ETH, there is no f(k)n° (k) time algorithm for 
the following problems for any computable function f: 


CLIQUE on regular graphs, 

INDEPENDENT SET on regular graphs, 
MULTICOLORED CLIQUE on regular graphs, 
MULTICOLORED INDEPENDENT SET on regular graphs, 
DOMINATING SET, 

DOMINATING SET ON TOURNAMENTS, 

CONNECTED DOMINATING SET, 
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e BALANCED VERTEX SEPARATOR parameterized by the size k of the solu- 
tion, 
e LIST COLORING parameterized by the treewidth k. 


Let us point out the surprising fact that we have the same lower bound 
here for DOMINATING SET and DOMINATING SET ON TOURNAMENTS (the 
exponent of n cannot be o(k)), despite the significant difference in complexity 
for the unparameterized versions: only 2°(”)-time algorithms are known for 
DOMINATING SET and they are optimal under ETH (see Theorem|14.6), while 
DOMINATING SET ON TOURNAMENTS can be solved in time n? 087). From 
the viewpoint of Observation all that matters is that the reduction 
of Theorem [13.14] has the property that the new parameter is linear in the 
original parameter. The double-exponential dependence on k in the running 
time of the reduction does not affect the quality of the lower bound obtained 
in terms of the exponent of n. 

There are W[1]-hard problems for which it seems hard to obtain tight lower 
bounds using a parameterized reduction from CLIQUE and Observation [14.23] 
we need new technology to make the lower bounds tighter for these problems. 
Consider, for example, the ODD SET problem. In Theorem we have 
presented a parameterized reduction from MULTICOLORED CLIQUE to ODD 
SET. A crucial idea of the reduction was to represent both the vertices and the 
edges of the k-clique in the ODD SET instance, and this was the reason why 
the parameter of the constructed instance was k + (4) = O(k?). Therefore, 
Observation [14.2] with g(k) = O(k?) and h(k) = Vk implies that, assuming 
ETH, there is no f(k)n°\Y*)-time algorithm for ODD SET for any computable 
function f. The naive algorithm solves the problem in time nO) by brute 
force, giving a significant gap between the upper and the lower bound. As we 
have discussed in Section [13.6.3] it seems difficult to design a reduction from 
CLIQUE to ODD SET in which we represent only the vertices of the k-clique, 
and representing the (5) edges of the k-clique inevitably leads to a quadratic 
blow-up of the parameter. 

A key idea for making these reductions from CLIQUE tighter is to interpret 
them as reductions from SUBGRAPH ISOMORPHISM instead. In the SUBGRAPH 
ISOMORPHISM problem, we are given two graphs H and G; the task is to de- 
cide if H is isomorphic to a (not necessarily induced) subgraph of G. Param- 
eterized reductions from CLIQUE where vertices and edges are represented by 
gadgets (e.g, Theorem [13.31) can be usually generalized to reductions from 
SUBGRAPH ISOMORPHISM with minimal modifications. As finding a k-clique 
is a special case of SUBGRAPH ISOMORPHISM, Theorem [14.21] implies that, 
assuming ETH, there is no f(k)n°) time algorithm for SUBGRAPH ISO- 
MORPHISM for any computable function f, where k is the number of edges of 
the smaller graph H. The following result, whose proof is beyond the scope 
of this chapter, gives a lower bound parameterized by the number of edges 
of H: 
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Theorem 14.24 ([352]). Assuming ETH, there is no f(k)n°*/!8*)-time 
algorithm for SUBGRAPH ISOMORPHISM, where k is the number of edges of 
the smaller graph H and f is a computable function. 


We remark that it is an interesting open question whether the factor log k in 
the exponent can be removed, making this result tight. We can modify the 
reduction of Theorem[13.31|to be a reduction from SUBGRAPH ISOMORPHISM 
to ODD SET, where the new parameter is linear in the number of edges of the 


smaller graph H (Exercise}14.11). Then Theorem]|14.24]and Observation|14.22 


give a tighter lower bound, almost matching the trivial nNO”)-time algorithm. 


Theorem 14.25. Assuming ETH, there is no f(k)n°*/1°8") -time algorithm 
for ODD SET. 


Similarly, one can modify the reduction in the proof of Theorem 13.33] to 
get tighter lower bounds for STRONGLY CONNECTED STEINER SUBGRAPH 


(Exercise (14-19). 


Theorem 14.26. Assuming ETH, there is no f(k)n°*/!°8") -time algorithm 
for STRONGLY CONNECTED STEINER SUBGRAPH. 


CLOSEST SUBSTRING (a generalization of CLOSEST STRING) is an extreme 
example where reductions increase the parameter exponentially or even dou- 
ble exponentially, and therefore we obtain very weak lower bounds. In this 
problem, the input consists of strings s1, ..., s; over an alphabet X, and in- 
tegers L and d. The task is to find a string s of length L such that, for every 
1<i<t, si has a substring s; of length L with dy(s, si) < d. Here dy (x,y) is 
the Hamming-distance of two strings, that is, the number of positions where 
they differ. 

Let us restrict our attention to the case where the alphabet is of constant 
size, say binary. There is a (very involved) reduction from CLIQUE (with k 
being the size of the clique we are looking for) to CLOSEST SUBSTRING where 
d= 20) and t = 22° in the constructed instance. Therefore, we get weak 
lower bounds with only o(log d) and o(log log t) in the exponent. Interestingly, 
these lower bounds are actually tight, as there are algorithms matching them. 


Theorem 14.27 ([349]). CLOSEST SUBSTRING over an alphabet of constant 
size can be solved in time f(d)n©0°8® or in time f(t, d)nC°8!8, Further- 
more, assuming ETH, there are no algorithms for the problem with running 
time f(t, d)n°°8® or f(t,d)n°%°8s for any computable function f. 


14.4.1 Planar and geometric problems 


We have seen in earlier chapters that many parameterized problems are fixed- 
parameter tractable when restricted to planar graphs. There are powerful 
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Fig. 14.2: An instance of Grip TILING with k = 3 and n = 5. The red pairs 
form a solution 


techniques available for attacking planar problems, for example, the shift- 
ing strategy (Section [7.7.3) or bidimensionality (Section [7.7.2). Nevertheless, 
there exist natural parameterized problems that are W([1|-hard on planar 
graphs. In this section, we develop a convenient technology for proving the 
W[1]-hardness of these problems. Moreover, assuming ETH, this technology 
allows us to prove that there is no f(k) - novk) algorithm for the problem 
for any computable function f. This is very often matched by a correspond- 
ing algorithm, giving us tight understanding of the complexity of a planar 
problem. 

The key tool of the section is the following (somewhat artificial) problem, 
which will be a convenient starting point for reductions to planar problems. 
The input of GRID TILING consists of an integer k, an integer n, and a 
collection S of k? nonempty sets Sij C [n] x [n] (1 < i,j < k). The task is 
to find, for each 1 < i,j < k, a pair sij E Sij such that 


e If s;j = (a,b) and 84,5 = (a',b'), then a =a’. 
e If s;j = (a,b) and si j+1 = (a’, 0’), then b = b. 


In other words, if (i, j) and (2’, 7’) are adjacent in the first or second coordi- 
nate, then si; and sj, agree in the first or second coordinate, respectively. 
An instance of GRID TILING with a solution is shown in Fig. [14.2] we imagine 
Si j to be in row 7 and column j of a matrix. Observe that the constraints 
ensure that the first coordinate of the solution is the same in each column and 
the second coordinate is the same in each row. We can prove the hardness of 
GRID TILING by a simple reduction from CLIQUE. 


Theorem 14.28. GRID TILING is W[1]-hard and, unless ETH fails, it has 
no f(k)n°™ -time algorithm for any computable function f. 
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Proof. The proof is by reduction from CLIQUE. Let (G,k) be an instance of 
CLIQUE. Let n = |V(G)|; we assume that the vertices of G are the integers 
in [n]. We construct an instance of GRID TILING as follows. For every 1 < 
i,j < k, we define 
g, {(a,a) : l<a<n} if i=, 
‘J  \{(a,b) : a# band vertices a and b are adjacent in G} if i Æj. 
We claim that the constructed instance of GRID TILING has a solution if 
and only if G contains a k-clique. First, suppose that vertices v1, ..., vz are 
distinct and form a clique in G. Then it is easy to verify that sij = (vj, vi) 
(note that the order of i and j is swapped in the indiceq*) is a solution of GRID 
TILING. Conversely, suppose that the pairs s;,; (1 < i,j < k) form a solution. 
As si j and s;+1,; agree in the first coordinate and s; j and s; j+1 agree in the 
second coordinate, it follows that there are values v1, ..., Uk, Uj, ---, Up Such 
that s;; = (vj, vi) for every 1 < i,j < k. As Sii = {(a,a) : 1<a<n}, 
this is only possible if v; = v; for every 1 <i < k. It follows that v; and vj 
are two adjacent (distinct) vertices: as (vj, vi) = (uj, vi) € Sij, the definition 
of Si j implies that vjv; is an edge of G. Therefore, v1, ..., vg is indeed a 
k-clique in G. 














There are two aspects of GRID TILING that make it particularly suited 
for reduction to planar problems: 


1. If the sets are represented by gadgets, then each gadget has to 
interact only with the gadgets representing adjacent sets, making 
the high-level structure of the constructed instance planar. 

2. The constraint that needs to be enforced on adjacent gadgets is 
fairly simple: equality of the first /second coordinate of the value. 


The following reduction demonstrates the first aspect of GRID TILING. In 
Section [13.6] we have shown, by a reduction from MULTICOLORED CLIQUE, 
that List COLORING is W[1]-hard parameterized by treewidth. Furthermore, 
in gga EE we have observed that the reduction actually shows that it 
has no f(t)n°“-time algorithm for any computable function f, unless ETH 
fails. As the reduction in Theorem was from MULTICOLORED CLIQUE, 
the created instance is highly non-planar (see Fig. (13.5), being essentially a 
subdivided clique. On the other hand, when we reduce from GRID TILING, 
the resulting graph is grid-like, and hence planar. 





5 We could have defined Grip TILING by swapping the meaning of the coordinates of sij, 
that is, s;,; and s;,;41 have to agree in the first coordinate. Then we would have avoided 
the unnatural swap of indices in this proof. However, as we shall see, in many proofs (e.g., 


Theorems |14.32]and|14.34), the current definition is much more convenient. 
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Fig. 14.3: The graph G” of the List COLORING instance constructed in the 
reduction of Theorem |14.29|for k = 4 


Theorem 14.29. LIST COLORING on planar graphs parameterized by the 
treewidth t is W[1]-hard. Moreover, it has no f(t)n°® -time algorithm for 
any computable function f, unless ETH fails. 


Proof. We present a parameterized reduction from GRID TILING. Let (n, k, S) 
be an instance of GRID TILING. The set C of colors will correspond to [n] x [n]; 
then every set S; j can be interpreted as a set of colors. We construct a graph 
G” the following way (see Fig. 14.3). 


i) For every 1 < i,j < k, we introduce a vertex u;; with L(u;;) = S;.;. 
J J J J 
(ii) For every 1 < i < k, 1 < j < k and for every pair (a,b), (a’,b’) € 
[n] x [n] with a # a’, we introduce a vertex ;,j,(a,b),(a’,b/) Whose list is 
{(a, b), (a’, b’)} and make it adjacent to u; j and ui+1,;- 
(iii) For every 1 < i < k, 1 < j < k and for every pair (a,b), (a',b') € 
[n] x [n] with b Æ b’, we introduce a vertex hi; j (a,b),(a’,’) Whose list is 
{(a, b), (a’, b’)} and make it adjacent to u; j and ui j+1- 


To bound the treewidth of G’, observe that every vertex other than the 
vertices u; į; has degree 2. Suppressing degree-2 vertices of G’ does not change 
its treewidth (see Exercise|7.13), assuming its treewidth is at least 3, and gives 
a k x k-grid with parallel edges, which has treewidth k. 

Suppose that {s;;j € Sij : 1<%,j < k} isa solution of the Grip TILING 
instance; let s;,; = (a;,;,0;,;). We can construct the following list coloring 
of G”. First, we set the color of u; j to sij E Sij = L(ui j). Then for every 
vertex U;,;,(a,b),(a’,b’) it is true that either u; j has a color different from (a, b), 
or Uj+1,j has a color different from (a’,b’) (as aij = ai+1,;, while a # a’). 
Therefore, one of the two colors appearing in the list of v; j (a,b),(a’,p’) 18 not 
used on its neighbors, hence we can extend the coloring to this vertex. The 
argument is similar for the vertices hj; (a,b),(a’,b’)- 
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For the reverse direction, suppose that c : V(G’) — [n] x [n] is a list 
coloring of G’. We claim that the pairs s;,; = c(uj;) € L(uij) = Sij form 
a solution for the GRID TILING instance. For a contradiction, suppose that 
Si j = (a,b) and s;41,; = (a’,b’) do not agree in the first coordinate, that is, 
a # a’. Then the vertex vi j,(a,b),(a’,b’) exists in G and has list {(a, b), (a’, b')}- 
However, this vertex is adjacent to both u; j and ui+1,j, which have colors 
(a,b) and (a’, 0’) in the coloring; hence v;,;,(a,b),(a’,p’) cannot have either color 
on its list, a contradiction. The argument is similar in the case when s; j 
and s;,;41 do not agree in the second coordinate; in this case, we arrive to a 
contradiction by observing that both colors in the list vertex hi j (a,b),(a',b') 
are already used by its neighbors. 














The second reason why GRID TILING is a useful starting point for reduc- 
tions to planar problems is that the constraints between adjacent gadgets 
are very simple: it requires that the first /second coordinates of the values of 
the two gadgets are equal. This can be much simpler than testing adjacency 
of two vertices in a reduction from MULTICOLORED CLIQUE and is similar 
to the projection constraint between edge gadgets and vertex gadgets (see 
Theorem 13.31). In Theorem the List COLORING problem was very 
powerful and it was easy to express arbitrary binary relations, hence this 
aspect of GRID TILING was not exploited. 

There are reductions, however, where it does matter that the constraints 
between the gadgets are simple. We introduce a variant of GRID TILING 
where the constraints are even more suited for planar problems. The input of 
GRID TILING WITH < is the same as the input of GRID TILING, but we have 
less than or equal constraints between the first /second coordinates instead of 
equality (see Fig. [14.4p. That is, the task is to find, for each 1 < i,j <k,a 
value s; j € Sj; such that 


e If si; = (a,b) and s;41,; = (a',b'), then a < a’. 
e If s;;j = (a,b) and si j+1 = (a',b'), then b < b. 


This variant of GRID TILING is very convenient when we want to prove 
lower bounds for a problem involving planar/geometric objects or distances 
between points: the < constraints have natural interpretations in such prob- 
lems. 

We show first that the lower bounds of Theorem hold for GRID 
TILING WITH < as well. 


Theorem 14.30. GRID TILING WITH < is W[1]-hard and, unless ETH fails, 
it has no f(k)n°™ -time algorithm for any computable function f. 


Proof. Given an instance (n, k, S) of GRID TILING, we construct an equiva- 
lent instance (n’,k’,S’) of GRID TILING WITH < with n’ = 3n? (k+1)+n?+n 
and k’ = 4k. For every set S; j of the input instance, there are 16 correspond- 
ing sets St, „ (46-3 < i' < 4i, 4j — 3 < 7’ < 4j) in the constructed instance 
(see Fig. 14.5). We call these sets the gadget representing Si j. The four inner 
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S11: Sy 9: 51,3: 
(1,1) (5,1) (1,1) 
(3,1) (1,4) (2,5) 
(2,4) (5,3) (3,3) 
S21: S22: S23: 
(2,2) (3,1) (3,2) 
(1,4) (2,2) (2,3) 
S31: S39: S33: 
(1,3) (1,1) (5,4) 
(2,3) (2,3) (3,4) 
(3,3) 

















Fig. 14.4: An instance of GRID TILING WITH < with k = 3 and n = 5. The 
red pairs form a solution 


sets S4i 2,4j—2> S4; 2,4j—1; S4i 1,4j-2> S4i 1,4j—1 are dummy sets containing 
only the single pair shown in the figure. The 12 outer sets are defined as 
follows. We define the mapping (a,b) = na + b and let N = 3n?. For every 
(a,b) € Si j, we let z = (a, b) and introduce the pairs shown in Fig. [14.5]into 
the 12 outer sets. It can be easily verified that both coordinates of each pair 
are positive and at most n’. This concludes the description of the constructed 
instance. 





The intuition behind the construction is the following. Selecting a pair 
from one of the 12 outer sets in a gadget selects a pair from S; j. We 
show that in every solution, a “vortex” on the 12 outer sets ensures that 
each of them select the same pair si j € Si j- Then we show that the in- 
teraction of the outer sets of the gadgets enforce exactly the constraints 
of the GRID TILING problem. For example, the second set of the last 
row of the gadget representing S; j and the second set of the first row of 
the gadget representing S;41,; interact in a way that ensures that the 
first coordinate of s; j is at most the first coordinate of s;41,;, while the 
third sets of these rows ensure the reverse inequality, implying that the 
two first coordinates are equal. 


Let sij € Sij for 1 < i,j < k be a solution of the GRID TILING instance. 
For every si; = (a,b), we select the corresponding pairs from the 16 sets in 
the gadget of S; j for z = (a,b), as shown in Fig. First, it is easy to 
verify that the constraints are satisfied between the sets of the same gadget. 
Consider now the last row of the gadget of Si; and the first row of the 
gadget of Si+1,j- For the first sets in these rows, the constraints are satisfied: 


14.4 ETH and W[1]-hard problems 495 








Sii—3,45-3! Sti—3,4j—2! Sui—3,4j-1! Shi—3,45' 
(iN — z, jN +2) (iN +a, jN +z) (iN —a,jN +z) (GN +2,jN +2) 
Shi2,4j—-3" Shi—2,4j—2' Shi—2,4j—1' Shi—2,4j' 


UN-2,jN+0) |(G+DN,G+IN)| (N,G+IN) |GN+z,(j+1)N +b) 








Sji—1,4j—3} Sji—1,4j—2' Shi—1,4j—1° Shi 1,45: 
(iN — z,jN — b) ((@+1)N,jN) (iN,jN) (iN +z, (7 +1)N—b) 
Shi,aj—3! Shi aj—2! Shisj—1' Shi aj: 


(GN -— z, jN- z) |(i+1)N+a,jN—z)|(i+1)N-a,jN—z)} (iN+z,jN—z) 

















Fig. 14.5: The 16 sets of the constructed GRID TILING WITH < instance 
representing a set S; j of the GRID TILING in the reduction in the proof of 


Theorem |14.30} together with the pairs corresponding to a pair (a,b) € Sj, 
(with z = (a, 6)) 


the pair selected from S4; 4j—3 has first coordinate less than iN, while the 
pair selected from $4(;41)~3.4;-3 = S4i+1,4j—3 has the first coordinate at least 
(i+1)N — (n? +n) > iN. Similarly, there is no conflict between the last sets 
of these rows. If aij = ai41,; are the first coordinates of s; j and s;+1,;, then 
the first coordinates of the sets selected from the second sets of the rows, 
Shi sj—2 and S4;44;-2, are (i +1)N +ai j and (i+ 1)N + ai41,j, respectively, 
and the former is equal to the latter. One can show in a similar way that 
there is no conflict between the third sets of the rows, and that there is no 
conflict between the last column of the gadget representing S;,; and the first 
column of the gadget representing Si, j+1- 
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For the reverse direction, suppose that sj; for 1 < i,j < k’ is a solution 
of the constructed GRID TILING WITH < instance. Consider the 12 outer 
sets in the gadget representing S;,;. The 12 pairs selected in the solution 
from these sets define 12 values z; let z4;-3,4;-3, etc., be these 12 values. We 
claim that all these 12 values are the same. The second coordinate of the 
set selected from $4; 3 4;—-3 is JN + Z4i—3,4j—3, the second coordinate of the 
set selected from $4; 34; is JN + 24i-3,4j-2; hence the definition of GRID 
TILING WITH < implies Z4i—3,4j—3 < 24; 3,4j—2- With similar reasoning, by 


going around the outer sets, we get the following inequalities. 








Z4i—3,4j—3 Š Z4i—3,4j—-2 [Š Z4i—3,4j-1 Š 2ai—-3,43 (first row) 
Z4i—3,4j S Z4i—2,4j < Z4i—1,4j < Z4i,4j (last column) 

—Z4i4j—-3 S —Z4i4j—-2 Š —Z4i4j-1 Š —24i,4j7 (last row) 
—Z4i—3,4j—3 Í —Z4i—2,4j—3 Í —Z4i—1,4j—-3 ŠÍ —2ai,aj—3 (first column) 


Putting everything together, we get a cycle of inequalities showing that these 
12 values are all equal; let z’? be this common value and let s; j = (a; j, bij) 
be the corresponding pair, that is, L(a; j, bij) = 2*1. The fact that z’ was 
defined using the pairs appearing in the gadget of S; j implies that sij € Sij- 
Let us show now that aij = ai+1,;. The pair selected from S4; 4; has first 
coordinate (i + 1)N + aij, while the pair selected from $4(;41) 3 4;-2 = 
S4i+1,4j—2 has first coordinate (i + 1)N + ai41,j- The definition of Grip 
TILING WITH < implies now that ai; < aj41,; holds. Similarly, comparing 
the first coordinates of the pairs selected from $4; 4;_; and $4,414; Shows 
—ai,j < —ai+1,j; hence aij = a;41,; follows. A similar argument, by looking 
at the last column of the gadget representing S;,; and the first column of the 
gadget representing S; j+1 shows bi ; = bi j+1- Therefore, we have proved that 
the constructed s;,;’s indeed form a solution of the Grip TILING instance. 














SCATTERED SET is a generalization of INDEPENDENT SET: given a graph 
G and integers k and d, the task is to find a set S of k vertices in G such that 
the distance between any two distinct u,v € S is at least d. Clearly, INDEPEN- 
DENT SET is the special case d = 2, hence there is no f(k)n°“)-time algorithm 
for SCATTERED SET on general graphs, unless ETH fails (Corollary (14.23). 
On planar graphs, we have seen that the special case INDEPENDENT SET is 
FPT (parameterized by k) and has subexponential parameterized algorithms. 
Moreover, we can show that SCATTERED SET remains FPT on planar graphs 
for any fixed d, or with combined parameters k and d (Exercise [7.41). If dis 
part of the input, we can do somewhat better than finding a solution by an 
n©(*) time brute-force algorithm. 


Theorem 14.31 ([355]). SCATTERED SET on planar graphs can be solved 
O(VR) | 


in time n 
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We show now that the algorithm of Theorem|14.31]is optimal with respect 
to the dependence of the exponent on k. 


Theorem 14.32. SCATTERED SET on planar graphs is W[1]-hard when pa- 


rameterized by k only. Moreover, it has no f(k)n\*) -time algorithm for any 
computable function f, unless ETH fails. 


Proof. We present a parameterized reduction from GRID TILING WITH <. Let 
(n,k,S) be an instance of GRID TILING WITH <; without loss of generality 
assume that k > 2. We represent each S;; with an n x n grid R; ș; let yet 
be the vertex of this grid in row a and column b. (Note that the distance 
between the first and last columns or rows of an n x n grid is n — 1.) Let 
L = 100n and d = 4L +n — 1. We construct a graph G” in the following way 


(see Fig. (14.6). 


(i) For every 1<i,j < k and (a, b) € Sij, we introduce a vertex m0, and 
connect it to the vertex v^, with a path of length L. 
(ii) For every 1 < i < k, 1 < j < k, we introduce a new vertex a;,; and 


F ica 
connect a; ; to vid, and vt" 
L. 


(iii) For every 1 < i < k, 1 < j < k, we introduce a new vertex yi, and 
connect yi j to vi, and vith? for every b € [n] with a path of length L. 


for every a € [n] with a path of length 


Note that the resulting graph is planar, see Fig. 


We may assume that the solution selects exactly one vertex U from 
each grid R; j- The locations of these vertices correspond to a solution 
of the GRID TILING WITH < instance. The distance constraint of SCAT- 
TERED SET ensures that the row number of the selected vertex in R;,; is 
at most the row number of the selected vertex in Rj+41,;, and the column 
number of the selected vertex in R; j is at most the column number of 
the selected vertex in Rj j+1- 


Suppose that sij E Sij is a solution for the Grip TILING WITH < 
instance; let s;; = (ai,;,0;,;). For every 1 < i,j < k, let Z contain 
ij 


Qi j, 


bes? We claim that vertices in Z are at distance at least d from 
each other. Notice first that two adjacent grids have distance at least 2L from 
each other: we have to go through at least two paths of length L to go from 
Ri j to Rj41,; for example. Therefore, if |i — i'| + |j — "| > 2, then every path 
between z; j and zi j; has to go through at least 2-2-+2 = 6 paths of length 
L (the extra 2 comes from the 2 paths with endpoints z;,; and z; j’), hence 
their distance is at least 6L > d. Consider now the vertices z; j and 24+1,;. If 
a path P connecting these two vertices has length less than 6L, then P has 
to go through yi j- The distance between z; j and yi j is 2L +n — aij < 3L, 
which is realized by a path using two paths of length L and a vertical subpath 
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Fig. 14.6: Theorem reducing the instance of GRID TILING WITH < 
in Fig. [14.4] with k = 3 and n = 5 to an instance of SCATTERED SET. The 
red edges represent paths of length L, the circled vertices correspond to the 
solution shown in red in Fig. [14.4] 


of the grid. Note that the shortest path cannot leave the grid and return to 
it, as then it would use at least four paths of length L. Similarly, we can 
observe that the distance between z;41,; and y; j is 20 + a;41,; — 1. Thus the 
distance between Zi j and Zi+1,j is 4L+n-—14 (Gi41,5 aij) > 4L+n-1= d, 
since sij and s;41,; satisfy the constraint that the first coordinate of s; j is 
at most the first coordinate of s;,1,;. A similar argument (using the fact that 
the second coordinate of s; j is at most the second coordinate of s; j+1) shows 
that the distance between z; j and z,;41 is at least. d. Therefore, the distance 
between any two vertices in Z is at least d. 
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For the reverse direction, suppose that Z is a set of k? vertices with pair- 
wise distance at least d. Let X; j be the set of all vertices that are at distance 
at most L < d/2 from some vertex of R; j. Observe that the distance between 
any two vertices in X; j is less than d, hence X; j can contain at most one 
vertex of Z. Furthermore, observe that the union of the X;,;’s cover the entire 
graph. Therefore, each of the k? vertices of Z appear in at least one of these 
sets, which is only possible if each X; ; contains exactly one vertex of Z and 
each vertex of Z appears in only one X; j. In particular, this means that the 
vertices a; j, Yi,j Of G, which are contained in two of these sets, cannot be 
part of the solution Z. We claim that no vertex of the grid R; j is selected. As 
k > 2, grid Ri j has a neighbor Ry j, that is, |i — i'| + |j — j'| = 1. Now every 
vertex of Xy j is at distance at most n—1+2L+n-— 1+ L< d from every 
vertex of R; j- This means that if a vertex of R; ; is in Z, then Xy y is disjoint 
from Z, contradicting our observation above. A similar argument shows that 
a vertex of X; j; N Z cannot lie on any path connecting a side of R; j with one 
of the neighboring vertices Qij, i,j—1y Vij Vi-1,3- Hence, the single vertex of 
Xi jNOZ has to be on some v; ie ‘bey =w zbi; Path for some (aij, bij) E Sij — 
otherwise it would be too close to an adjacent grid. We may actually assume 
that this vertex is the degree-1 vertex wg bd jibig At the end of the path: moving 
the vertex closer to the end of the path dogs not make it any closer to the other 
vertices of Z. Let zij = Weds be the vertex of Z contained in X; j, and 
let s; j = (a; j, bij). We claim that the pairs s; j form a solution for the GRID 
TILING instance. First, si j = (ai, j, bij) € Si,j follows from the way the graph 
is defined. To see that aij < aj41,;, let us compute the distance between zi, j 
aes and Ving is Ln- Qi js the 


and yi, j is L+ai+1,j— 1, hence the dis- 


and zi+1,j- The distance between z; j = w 
i+1,j 

ai+1,jbi+1,j 
tance between Zi j and Zi+1,j is 4L+n-—-14 (ai+i,j aij) = d+ (ai+i,j — Gi,;). 
As we know that this distance is at least d, the inequality aij < aj41,; fol- 
lows. By a oe ae computing the distance between z; j and 2,541 
shows that bi; < bi j+1 holds. Thus the pairs s;,; indeed form a solution of 
the GRID a WITH < instance. 


distance between zi+1, j = W 














We finish this section with a geometric problem. Given a set of disks of 
unit diameter in the plane (described by the coordinates of their centers 
and an integer k, UNIT DISK INDEPENDENT SET asks for a subset S of k 
disks that are pairwise disjoint. In other words, we need to select k centers 
such that any two of them are at distance at least 1 from each other. It is 
possible to solve the problem more efficiently than the nOC\*)-time brute-force 
algorithm. 





Theorem 14.33 ([9]). UNIT Disk INDEPENDENT SET can be solved in time 
nOw®), 


We show that the exponent O(Vk) in Theorem |14.33]is best possible by a 
reduction from GRID TILING WITH <. In the proof, we face a minor notational 
difficulty. When defining the GRID TILING problem, we imagined the sets S; j 
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arranged in a matrix, with S; j being in row 7 and column j. When reducing 
GRID TILING to a geometric problem, the natural idea is to represent S; j 
with a gadget located around coordinate (i, j). However, this introduces an 
unnatural 90 degrees rotation compared to the layout of the S;,;’s in the 
matrix, which can be confusing in the presentation of a reduction. Therefore, 
for geometric problems, it is convenient to imagine that S; j is “located” at 
coordinate (i, 7). To emphasize this interpretation, we use the notation S/z, y] 
to refer to the sets; we imagine that Sx, y] is at location (x,y), hence sets 
with the same x are on a vertical line and sets with the same y are on the 
same horizontal line (see Fig. (14.7). The constraints of GRID TILING are the 
same as before: the pairs selected from S[x,y] and S[x + 1,y] agree in the 
first coordinate, while the pairs selected from S[x,y] and S[x,y + 1] agree in 
the second coordinate. GRID TILING WITH < is defined similarly. With this 
notation, we can give a very clean and transparent reduction to UNIT DISK 
INDEPENDENT SET. 


Theorem 14.34. UNIT DISK INDEPENDENT SET is W[1]-hard and, unless 
ETH fails, it has no f(k)nv*) -time algorithm for any computable function 
f. 

Proof. It will be convenient to work with open disks of radius 5 (that is, 
diameter 1) in this proof: then two disks are nonintersecting if and only if 
the distance between their centers is at least 1. The proof is by reduction 
from GRID TILING WITH <. Let I = (n, k, S) be an instance of GRID TILING 
WITH <. We construct a set D of unit disks such that D contains a set of k? 
pairwise nonintersecting disks if and only if J is a yes-instance. 

Let € = 1/n?. For every 1 < x,y < k and every (a,b) € S[x,y] € [n] x [n], 
we introduce into D an open disk of radius 5 centered at (x + £a, y +b); let 
D{zx, y] be the set of these |S{x, y]| disks introduced for a fixed x and y (see 
Fig. (14.7). Note that the disks in D[x, y] all intersect each other. Therefore, 
if D' C D is a set of pairwise nonintersecting disks, then |D’| < k? and 
|D'| = k? is possible only if D’ contains exactly one disk from each D|z, y]. In 
the following, we prove that there is such a set of k? pairwise nonintersecting 
disks if and only if I is a yes-instance. 

We need the following observation first. Consider two disks centered at 
(x+ea,y+eb) and (x +1+ea', y +eb') for some (a,b), (a’, b’) € [n] x [n]. We 
claim that they are nonintersecting if and only if a < a’. Indeed, if a > a’, 
then the square of the distance of the two centers is 





(1 +ela' —a))? + — b)? < (1+e(a' —a))? + €?n? 
<(l-e+e=l-e+e’<1 


(in the first inequality, we have used b’,b < n; in the second inequality, we 
have used a > a’ + 1 and £ = 1/n?). On the other hand, if a < a’, then the 
square of the distance is at least (1 + ¢(a’ — a))? > 1, hence the two disks do 
not intersect (recall that the disks are open). This proves our claim. A similar 
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S[1, 3}: S[2, 3]: S13, 3]: 
(1,1 (3,2) (5,4) 
(2,5 (2,3) (3,4) 
(3,3 

S/1, 2]: S[2, 2]: sS[3, 2]: 
(5,1 (3,1) (1,1) 
(1,4 (2,2) (2,3) 
(5,3 

S/1, 1): S[2, 1): S[3, 1): 
(1,1 (2,2) (1,3) 
(3,1 (1,4) (2,3) 
(2.4 (3,3) 























Fig. 14.7: An instance of GRID TILING WITH < with k = 3 and n = 5 and the 
corresponding instance of UNIT DISK INDEPENDENT SET constructed in the 
reduction of Theorem [14.34] The small dots show the potential positions for 
the centers, the large dots are the actual centers in the constructed instance. 
The shaded disks with red centers correspond to the solution of GRID TILING 
WITH < shown on the left 


claim shows that disks centered at («+ea,y+eb) and (a+ea’,y+1+eb’) are 
nonintersecting if and only if b < b’. Moreover, it is easy to see that the disks 
centered at (a+ ea,y+eb) and (2’ + <a’, y’ +b’) for some 1 < a,a’,b,b' < n 
cannot intersect if |x — «| + |y — y'| > 2: the square of the distance between 
the two centers is at least 2(1 — en)? > 1. 

Let the pairs s|x, y] € S[zx, y] form a solution of the instance J; let s|x, y] = 
(ala, y], b[z, y]). For every 1 < x,y < k, we select the disk d[x, y] centered at 
(x +ea[x, y], y+ eblx, y]) € D(x, y]. As we have seen, if |x — z'| + |y — y'| > 2, 
then d|x, y] and d[z’,y’] cannot intersect. As the s[z,y]’s form a solution of 
instance I, we have that alx,y] < ala + 1,y]. Therefore, by our claim in 
the previous paragraph, the disks d[z,y] and d{a + 1,y] do not intersect. 
Similarly, we have b[z, y] < bla, y+ 1], implying that d[x, y] and d[x, y+ 1] do 
not intersect either. Hence there is indeed a set of k? pairwise nonintersecting 
disks in D. 

Conversely, let D’ C D be a set of k? pairwise independent disks. This is 
only possible if for every 1 < x,y < k, the set D’ contains a disk d[x,y] € 
Diz, y], that is, centered at («+<ealx, y], j7+eb[x, y]) for some (als, y], bla, y]) € 
[n] x [n]. We claim that the pairs s[x, y] = (ala, y], b[x, y]) form a solution of 
the instance I. First, d[x,y] € D[x,y] implies that s[xz,y] € S[a,y]. As we 
have seen above, the fact that d[x, y] and d[x + 1, y] do not intersect implies 
that a[z,y] < aļxz + 1,y]. Similarly, the fact that d|x, y] and d[a, y + 1] do not 
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intersect each other implies that b[x, y] < b[z, y + 1]. Thus the s[z, y]’s form 
a solution for the GRID TILING WITH < instance I. 














*14.5 Lower bounds based on the Strong 
Exponential-Time Hypothesis 


In essence, ETH enables us to estimate the optimal magnitude of the para- 
metric dependency of the running time of parameterized algorithms. SETH is 
a more precise tool, using which one can pinpoint almost exactly the asymp- 
totic behavior of the optimal running time, at a cost of adopting a much 
stronger complexity assumption. For instance, suppose we are working with 
a parameterized problem admitting an O*(3") algorithm. A typical lower 
bound under ETH is of the form “No algorithm with running time O*(2°”)) 
can be expected”. Under SETH one can prove statements like “For any € > 0, 
there is no algorithm with running time O*((3 — ¢)*)”, thus showing that 
even the constant 3 in the base of the exponent is hard to improve. 

Observe that for any constant c, saying that there exists an algorithm with 
running time O*((c — €)*) for some € > 0 is equivalent to saying that there 
exists an algorithm with running time O*(c>*) for some 6 < 1. While the first 
notation gives a more intuitive feeling about what the lower bound actually 
means, the second is more convenient when it comes to formal proofs. We 
will hence use both notations, jumping between them implicitly. 

Lower bounds under SETH can be proved by providing reductions from 
variants of CNF-SAT, similarly to the lower bounds under ETH. However, 
they are typically much more delicate. For ETH we are usually interested 
only in the asymptotics of how the parameter is transformed — it is relevant 
whether the output parameter is linear or quadratic in the input one, but the 
precise leading coefficient of the linear function does not play any role. For 
SETH it is precisely this coefficient that is crucial. To see it on an example, 
say that we provide two reductions from CNF-SAT to some parameterized 
problem L, where in the first reduction the output parameter k is equal to 2n 
(here, n is the number of variables), and in the second reduction the output 
parameter is equal to n. Then the first reduction shows only that, for any 
€ > 0, there is no algorithm with running time O*((/2 — ¢)*) assuming 
SETH, while the second one improves this lower bound to nonexistence of an 
O*((2 — £)¥) algorithm. 

Interestingly, observe that in the presented example we could state the 
lower bound under a slightly weaker assumption that the general CNF-SAT 
problem cannot be solved in time O*((2—<)”) for some € > 0. This is a com- 
mon phenomenon in many lower bounds under SETH, where the reduction 
can be assumed to start from general CNF-SAT, rather than from q-SAT 
for increasing, yet constant values of g. However, there are examples where 
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it is helpful to use the full power of SETH; we will see one such lower bound 
in the next section. 

Before we proceed, let us comment that the problems for which SETH 
is known to give a tight understanding of their (parameterized) complexity 
are much scarcer than the problems affected by ETH. The difficulty lies in 
the very specific requirements that reductions proving lower bounds under 
SETH have to satisfy. Most known reductions from CNF-SAT or its variants 
encode the clauses of the input formula using some kind of gadgets. Therefore, 
in most cases it is hard to avoid the situation where the output parameter 
(or the total output size, if we are interested in the classical complexity of 
the problem) depends on the number of clauses instead of just variables; 
this is completely unacceptable when trying to prove a lower bound based on 
SETH. This applies in particular to problems parameterized by the requested 
size of the solution, like VERTEX COVER or FEEDBACK VERTEX SET. In 
fact, basically all the known lower bounds under SETH for FPT problems 
concern structural parameterizations, such as the size of the universe of the 
instance (say, in the HITTING SET problem) or some width parameter of the 
input graph; such parameters are often much easier to control in terms of the 
number of variables. 

Hence, one could argue that we are missing some important theoretical 
tool for SETH, similar to the sparsification lemma, which would enable us 
to unlock the full potential of this conjecture. Alternatively, this state of the 
art might indicate that SETH is simply too weak an assumption for proving 
precise statements about the running times of exponential-time algorithms, 
and we need stronger and better-suited working conjectures. 

In the following three sections we present exemplary lower bounds that can 
be proved under the assumption of SETH. Actually, these examples almost 
cover all the significant applications of SETH in parameterized complexity 
that are known so far, as explained in the previous paragraphs. 


14.5.1 HITTING SET parameterized by the size of the 
universe 


Recall that the HITTING SET problem can be solved by a brute-force search in 
time O*(2"), where n is the size of the universe/*| It is natural to ask whether 
the constant 2 in the base of the exponent could be improved, or, in other 
words, whether one can search through significantly less than 2” possible 
candidates for the solution. It appears that this is not the case under the 
assumption of SETH — the barrier of brute-force 2” enumeration is hard to 
break. 





6 We recall that the O*(-) notation hides factors polynomial in the total input size, i.e., 
here polynomial in |U| + |F|, and not just polynomial in n = |U]. 
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The first natural idea for a reduction from CNF-SAT would be the follow- 
ing. Create two elements of the universe tz, fy for each variable x, and add a 
set containing this pair to the constructed family. Set the budget for the size 
of the hitting set to be equal to n, so that in each of these two-element sets 
exactly one of the elements must be chosen. Choosing t, corresponds to set- 
ting the variable to true, and choosing f, corresponds to setting it to false, so 
we have one-to-one correspondence between assignments of the variables and 
candidate solutions. Now satisfaction of clauses can be encoded directly as 
hitting sets: for instance for a clause xV ~y V z we will create a set {tz, fy, tz}, 
which is hit if and only if the clause is satisfied. 

Unfortunately, this reduction gives only an O*((V/2 — €)") lower bound, 
since the universe in the output instance is of size 2n. The problem is about 
redundancy: if we know that some t, is chosen to the solution, then we are 
sure that fy is not chosen, and thus we did not exploit the whole 2?” search 
space of the output instance, but rather a subspace of cardinality 2”. We 
hence need a smarter way of packing the information. 

Suppose that the reduction we are going to construct will create a universe 
U of size n’, which is slightly larger than n; for now think that n! = n + 
O(log n). Suppose further that k is the intended size of the hitting set. Now 
observe that k must be very close to n’/2 for the following reason. If k < 
0.49 -n’ or k > 0.51 - n’ for some n’, then the solution might be found 
by brute-force verification of all the sets of (ap) or eee and there 
are only O((2 — ¢)") such sets for some £ > 0. Therefore, the intuition is 
that the 2” search space of CNF-SAT has to be embedded roughly into 
(ara); since Gara) EO (25) by Stirling approximation, the additional 
additive O(log n) factor in n’ should be sufficient to ensure enough space to 
accommodate all possible variable assignments. 

It seems, however, challenging to implement such an embedding so that the 
clauses can be also conveniently checked. Now is the point when it becomes 
useful to start with g-SAT instead of general CNF-SAT. Let us arbitrarily 
partition the variable set into large, but constant-size groups. Let p be the 
size of the group. Each group X will have a corresponding part of the universe 
Ux of size p' = p+ O(log p), and the space of assignments of variables of X 
will be embedded roughly into a) which has size Cy) > 2?. Thus we 
have n’ = g - n, and when p grows to infinity then n’ becomes bounded by 
(1+ a)n, for any a > 0. Finally, provided that we start with g-SAT instead 
of general CNF-SAT, every clause touches at most q groups of size p each, 
and hence it can be modelled using a constant (yet exponential in q and p) 
number of sets in the family. 

We now proceed to a formal explanation. 





Theorem 14.35. Unless SETH fails, there is no algorithm for HITTING SET 
that achieves running time O*((2—«)") for any £ > 0, where n is the size of 
the universe. 
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Proof. We shall present a reduction that for any constants a > 0 and q > 3, 
takes an instance of g-SAT on n variables and transforms it in time poly- 
nomial in the input size into an equivalent instance of HITTING SET on a 
universe of size at most (1 + a)n. We note that in this reduction we will 
treat q and a as constants, and thus both the size of the family of the output 
HITTING SET instance and the running time of the reduction can depend 
exponentially on q and 1/a. 

We first verify that the existence of such a reduction will provide us with 
the claimed lower bound. For the sake of contradiction, assume that HITTING 
SET indeed can be solved in time O*(2°") for some constant ô < 1. Fix some 
a > 0 so that a < 1 and 6’ := 6(1+ a) < 1. Assuming SETH, there exists 
a constant q such that q-SAT cannot be solved in time O*(2°”). Consider 
now the following algorithm for g-SAT: apply the presented reduction for 
parameters a and q, and then solve the resulting instance of HITTING SET 
using the assumed faster algorithm. Thus we could solve q-SAT in time 
O* (29(1+a)n) — O* (2°), which contradicts our choice of q. 

We now proceed to the reduction itself. Let y be the input instance of 
q-SAT, let Vars be the variable set of p, let Cls be the clause set of p, 
and let n = |Vars|, m = |Cls|. We fix some odd constant p > 3 such that 
p + 2flogsp] < (1+ a/3)p. Let p' = p+ 2[log, p|; note that p’ is also odd. 
Without loss of generality we will assume that n is divisible by p. This can 
be achieved by adding at most p— 1 dummy variables not appearing in any 
clause, which increases the size of the variable set by multiplicative factor at 
most 1 + a/3, unless the input instance is of constant size and we may solve 
it by brute-force. 

Partition Vars into n/p groups of variables Vars,, Vars2,...,VarSn/p, 
each of size p. For each group Vars; we create a set of elements U; of size p’. 


Let At = Gea) and Al = (oe) Note that |A!| = |.At| > iy > fo + QP. 
2 2 


It can be readily checked that for p > 3 it holds that p? > 1+ p’, so | Ay = 
IAT] > 2P, Therefore, let us fix an arbitrary injective mapping 7; from the set 
Varsi? of all the true/false assignments of the variables of Vars; to the 
set Ab. Let B; = A‘ \ Im(n;), where Im(n;) denotes the image of ni. 

Take any clause C € Cls and let Varsj,.,,Varsjo.,---,VarSjo,,,, be all 
the variable groups that contain any variable appearing in C. Since C has at 
most q literals, we have that rc < q. Hence the set Varsc := U}S, Varsjc, 
has cardinality at most p’g, which is a constant. Now consider all the 
true/false assignments of the variables of Varsc, and let Zc be the set of 
those assignments for which the clause C is not satisfied. For each ¢ € Zo 
construct a set FE by taking 


ro 
FE T U Ncw (Clvarsio s) 


t=1 
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Now we construct the universe of the output HITTING SET instance by tak- 
ing U = ye U;, while the family F is defined as follows. The family F 


comprises: 


e all the sets contained in Al, for each i = 1,2,...,n/p; 
e all the sets contained in B;, for each i = 1,2,...,n/p; and 
e all the sets FE for C € Cls and ¢ € Zo. 
Finally, we set budget k = n/p- pL and output the instance (U, F, k) of 
HITTING SET. Note that |U| < (1 + a/3)n, which, together with the pre- 
liminary blow-up of the universe size due to introducing dummy variables, 
gives a total increase of the universe size by a multiplicative factor at most 
(1+a/3)? <1+a (asa <1). 

We now argue equivalence of the instances. Assume first that p admits a 
satisfying assignment Y: Vars + {1,7}. Construct a set X by taking 


n/p 


X = Ui \ nWhvers,)s 


i=1 


and observe that |X| = k. We claim that X is a solution to the HITTING 
SET instance (U,F,k). For any i € [n/p] and any F € A! we have that 
|F| =|XNU,| = pee and |U;| = p', so X N F must be nonempty. Moreover, 
we have that U; \ X € Im(n;) by the definition of X, so X N F Æ 0 for any 
Fe Bi. 

Now consider any C € Cls and any Ç € Zc. Since C is satisfied by the 
assignment w, there exists a variable x such that x belongs to some group 
Varsj,, and a literal of x satisfies C. Let j = jo,r. Since Ç does not satisfy C, 
we in particular have that |vars,; Æ ¢|lvars;. Hence FE NU; = n; (Clvars;) # 
nj(Hlvars,) = U; \ X. Since |X n U;| = 24, |FENU;| = 54, |U;| =p’ and 
FE NU; # U; \ X, we have that FE NX OU; £0 and we are done with the 
first implication. 

Conversely, assume that there exists a set X C U, |X| < k, that has a 
nonempty intersection with every set of F. First observe that |XNU;| > = 
for each i € [n/p], since otherwise |U; \ X| > prr and X would not hit any 


subset of U; \ X of size PEL, but At C F. Since |X| < k = n/p- ptl and 
sets X N U; are pairwise disjoint, we infer that |X| = k and |X N U;| = pL 
for each i € [n/p]. Moreover, since X N F # @ for each F € B;, we infer 
that U; \ X € Im(nņ:). For each i let us construct an assignment %;: Varsi > 
{L, T} by taking y; = n7 (U:\ X). We claim that ~ = ue qh; is a satisfying 
assignment for y. 

Take any clause C € Cls, and assume for the sake of contradiction that 
w does not satisfy C. Let C = W|varsg; then we have that ¢ € Zo by the 
definition of Zc. Since X was a solution, we have that X N FE # Ø; hence 


XA FE N U; #0 for some j = jc. By the definition of FE we have that 
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FE NU; = n; (Clvars;) = n; (YV|vars;)- On the other hand, by the definition of 

w we have that U; \ X = ;(Wlvars,). Consequently U; \ X = FF N U}, and 
j Nj j j ¢ j 

so X N FE NU; = Í, a contradiction. 














Observe that the proof of Theorem [14.35|substantially depends on the fact 
that the starting point is an instance of q-SAT for a constant q, instead of 
arbitrary CNF-SAT. This is used to be able to construct the families Zo: if 
a clause could be arbitrarily large, then the size of the family Zc would be 
exponential in n, and the reduction would use exponential time. Hence, the 
lower bound of Theorem [14.35] holds only under the assumption of SETH, 
and not under the weaker assumption that CNF-SAT cannot be solved in 
time O*((2 — £)” ) for any € > 0. 

Lower bounds of a similar flavor to Theorem[14.35|can be proved for some 
other problems; examples include SET SPLITTING parameterized by the size 
of the universe, or NAE-SAT parameterized by the number of variables (see 
Exercise [14.16). Both these problems do not admit O*((2 — ¢)") algorithms 
for any € > 0, unless SETH fails. 

Recall, however, that for the HITTING SET problem we have also a 
dynamic-programming algorithm working in time O*(2™), where m is the 
size of the family; equivalently, SET COVER can be solved in time O*(2"), 
where n is the size of the universe (see Theorem (6.1). Is the constant 2 in 
the base of the exponent optimal also in this case? Actually, we neither have 
a faster algorithm, nor are able to link non-existence of such to SETH. This 
motivates the following conjecture; g-SET COVER is the SET COVER problem 
with an additional assumption that every F € F has cardinality at most q. 


Conjecture 14.36 (Set Cover Conjecture, [112]). Let Aq be the infini- 
mum of the set of constants c such that q-SET COVER can be solved in time 
O*(2°"), where n is the size of the universe. Then limg-... Ag = 1. In partic- 
ular, there is no algorithm for the general SET COVER problem that runs in 
O*((2 — £)”) for any € > 0. 


It appears that several essentially tight lower bounds for classic parameter- 
ized problems can be established under the Set Cover Conjecture. Therefore, 
the question of finding links between SETH and the Set Cover Conjecture 
seems like a natural research direction. 


Theorem 14.37 ( ). Unless the Set Cover Conjecture fails, 


o STEINER TREE cannot be solved in time O*((2—e)*) for any e > 0, where 
£L is the target size of the tree; 

e CONNECTED VERTEX COVER cannot be solved in time O*((2—¢)*) for 
any £ > 0, where k is the target size of the solution; 

e SUBSET SUM cannot be solved in time O*((2 — £)™) for any £ > 0, where 
m is the number of bits of the encoding of the target sum t. 
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14.5.2 Dynamic programming on treewidth 


Recall that in Chapter|7|we have learned how to design dynamic-programming 
routines working on a tree decomposition of a graph. For many classic prob- 
lems with simple formulations, the straightforward dynamic program has 
running time O*(c’) for some typically small constant c. For instance, for 
INDEPENDENT SET one can obtain running time O* (2t), while for DOMINAT- 
ING SET it is O*(3"). The intuitive feeling is that this running time should be 
optimal, since the natural space of states exactly corresponds to the informa- 
tion about a subtree of the tree decomposition that needs to be remembered 
for future computations. In other words, every state is meaningful on its own, 
and can lead to an optimal solution at the end. It is natural to ask whether 
under the assumption of SETH we can support this intuition by a formal 
lower bound. 

This is indeed the case, and we can use a similar approach as that for 
Theorem (we sketched the main ideas behind its proof in the discussion 
after the statement). In this section we will discuss in detail the case of the 
INDEPENDENT SET problem parameterized by pathwidth, which turns out 
not to admit an O*((2 — ¢)?) algorithm for any € > 0, assuming SETH. 
Before we proceed with a formal argumentation, let us discuss the intuition 
behind the approach. Note that a lower bound for pathwidth is even stronger 
than for treewidth. 

We start the reduction from an arbitrary CNF-SAT instance on n vari- 
ables and m clauses. The idea is to create a bundle of n very long paths 
P!, P?,...,P” of even length, corresponding to variables 271, %2,..., £n. AS- 
sume for now that on each of these paths the solution is allowed to make 
one of two choices: incorporate into the independent set either all the odd- 
indexed vertices, or all the even-indexed vertices. Then for every clause we 
construct a clause verification gadget and attach it to some place of the bun- 
dle. The gadget is adjacent to paths corresponding to variables appearing 
in the clause, and the attachment points reflect whether the variable’s ap- 
pearance is positive or negative (glance at Fig. [14.9] for an idea of how this 
will be technically carried out). As usual, the role of the clause gadget is to 
verify that the clause is satisfied. Satisfaction of the clause corresponds to 
the condition that at least one of the attachment points of the clause gadget 
needs to be not chosen into the constructed independent set; hence the clause 
gadget needs to have the following property: the behavior inside the gadget 
can be set optimally if and only if at least one of the attachment points is 
free. Fortunately, it is possible to construct a gadget with exactly this prop- 
erty, and moreover the gadget has constant pathwidth, so it does not increase 
much the width of the whole construction. See Fig. [14.8] and Claim [14.39] for 
the implementation. 

One technical problem that we still need to overcome is the first technical 
assumption about the choices the solution makes on the paths P*. It is namely 
not true that on a path of even length there are only two maximum-size 
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K1 K2 K3 K4 K5 K6 


Fig. 14.8: Gadget Mg with maximum independent set Z4 (i.e., the one with 
Z4 OT = {k4}) highlighted 


independent sets: the odd-indexed vertices and the even-indexed vertices. The 
solution can first start with picking only odd-indexed vertices, then make a 
gap of two vertices, and continue further with even-indexed vertices. Thus, on 
each path there can be one “cheat” where the solution flips from odd indices 
to even indices. The solution to this problem is a remarkably simple trick 
that is commonly used in similar reductions. We namely repeat the whole 
sequence of clause gadgets n + 1 times, which ensures that at most n copies 
are spoiled by possible cheats, and hence at least one of the copies is attached 
to area where no cheat happens, and hence the behavior of the solution on 
the paths P* correctly encodes some satisfying assignment of the variable set. 
We now proceed to the formal argumentation. 


Theorem 14.38. Suppose there exists a constant € > 0 and an algorithm 
that, given an integer k and a graph G with its path decomposition of width 
p, checks whether G admits an independent set of size k in time O*((2—e)?). 
Then CNF-SAT can be solved in time O* ((2—¢€)”), and in particular SETH 
fails. 


Proof. We provide a polynomial-time reduction that takes a CNF-SAT in- 
stance y on n variables, and constructs an instance (G, k) of INDEPENDENT 
SET together with a path decomposition of G of width n + O(1). Provided 
that the supposed faster algorithm for INDEPENDENT SET existed, we could 
compose the reduction with this algorithm and solve CNF-SAT in time 
O*((2 — €)”). 

We first make an auxiliary gadget construction that will be useful in the 
reduction. Let M, be the graph depicted in Fig. [I4.8] consisting of a sequence 
of r triangles with each two consecutive ones connected by two edges, and two 
pendant vertices attached to the first and the last triangle. By K1, k2,..., Kr 
we denote the degree-2 vertices in the triangles, as shown on Fig. and 
we also denote T = {k1, k2,...,K,}. The following claim, whose proof is left 
to the reader as Exercise [14.14] summarizes all the properties of M, that we 
will need. 


Claim 14.39. Assume r is even. Then the maximum size of an independent 
set in My is equal to r+ 2, and each independent set of this size has a 
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nonempty intersection with T. Moreover, for every a =1,2,...,r there exists 
an independent set Za in M, such that |Za| = r+2 and Za QT = {ka}. 
Finally, the graph M, has pathwidth at most 3. 


Thus, graph M, can be thought of as a gadget encoding a single 1-in-r 
choice. A similar behavior could be modelled by taking M, to be a clique on 
r vertices, but the crucial property is that M, has constant pathwidth. In 
the reduction we will use multiple copies of M, for various values of r. For a 
copy H of M,, by ka( H), Z,(H), and Z(H) we denote the respective objects 
in H. 

We now proceed to the construction itself. Let Vars = {£1, £2,..., En} 
Cls = {C\,C2,...,Cm} be the variable and the clause set of p, respectively. 
For C € Cls, by |C| we will denote the number of literals appearing in 
C, and let us enumerate them arbitrary as (9, 09,... Lo) By duplicating 
some literals if necessary we will assume that |C| is even for each C € Cls. 
Without loss of generality we assume that no clause of Cls contains two 
opposite literals of the same variable, since such clauses are satisfied in every 
assignment. 

We first construct an auxiliary graph Gp as follows. Introduce n paths 
Pt, P?,...,P”, each having 2m vertices. Let pp, p{,---;Pam—1 be the vertices 
of P’ in the order of their appearance. For each clause C}, construct a graph 
Hj being a copy of Mic,- For every a = 1,2,...,|Cj|, let xi, be the variable 
appearing in literal Oi E 093 = Ti, then connect kq(H;) with Po and if 
i= =Ti, then connect kq(H;) with pr This concludes the construction 
of the graph Go. 

To construct the output graph G, we create n + 1 copies of the graph Go; 
denote them by Gj, G2, G3,..., Gn+1- We follow the same convention that 
an object from copy Ga, for a = 1,2,...,n+ 1, is denoted by putting Ga 
in brackets after the object’s name in Go. Connect the copies sequentially 
as follows: for each a = 1,2,...,n and each 2 = 1,2,...,n, connect vertices 
Pm-1(Ga) and pi (Ga+1). This concludes that construction of graph G. The 
following claim, whose proof is left to the reader as Exercise verifies 
that G indeed has low pathwidth. 


Claim 14.40. G admits a path decomposition of width at most n + 3, which 
furthermore can be computed in polynomial time. 


Finally, we set the expected size of the independent set to be k = 
(n+ 1)- (mn + X; ICs + 2). It now remains to argue that G admits 
an independent set of size k if and only if ọ is satisfiable. 

Assume first that y admits a satisfying assignment Y: Vars > {L,T}. 
We first construct an independent set Xo of size mn + X57 (|C;| + 2) in 
Go. For every i = 1,...,n, if w(a;) = L then in Xo we include all vertices 
pi with even b, and if ~(2;) = T then we include all vertices p? with odd b. 
For each clause C} fix an index a; such that the literal esi satisfies C} in the 


*14.5 Lower bounds based on the Strong Exponential-Time Hypothesis 


Or 
m 


1 





Pi pP? Ps pt P53 Pô 


p 


EG 


Gi 














TU 


G2 





Y 








ji 





jay 


Ahh A | A 


G3 









































Fig. 14.9: Constructions in the proof of Theorem Left side: Connections 
adjacent to a clause gadget H; for Cj = 71 Vaz V 724V x6, together with the 
constructed independent set for variable assignment w(r4) = Y(z6) = L and 
p(z) = w(a2) = (x3) = (a5) = T. Note that the behavior in the gadget 
H; has been set according to the choice of literal =~x4 as the literal satisfying 
Cj, but we could also choose literal z1. Right side: Arranging copies of Go 
into G 














assignment 7; let x;, be the variable appearing in this literal. Then include 
into Xo the independent set Za, (H;) given by Claim 

Clearly |Xo| = mnt yin (ICj |+2). We now verify that Xo is independent 
in Go. From the construction it directly follows that the vertices of X lying 
on paths P’ are pairwise nonadjacent, and also vertices lying in different 
gadgets H; are nonadjacent. Vertices of Xo lying in the same gadget H; 
are nonadjacent by Claim It remains to verify that no vertex of Xo 
belonging to a gadget H; can see any vertex of Xo lying on any path P’. The 
only vertices of H; adjacent to the vertices outside H; are vertices of Z(H;), 
and we have Xo NZ(H;) = {Ka,;(H;)}. The only neighbor of ka, (H;) outside 
H; is the vertex pj,» if oii = ti, or pij İf tei = —7;,. Since toi satisfies 
C; in the assignment y, it follows from the definition of Xo that precisely 
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this vertex on P*i is not chosen to Xo. So indeed Xo is an independent set 
in Go. 

Let X1, X2,...,Xn+1 be the images of Xo in the copies G1, G2,...,Gn+41, 
and let X = uzi Xi. Clearly |X| = k, and it follows easily from the con- 
struction that X is independent: for every edge p},,,_1(Ga)p)(Ga+1) con- 
necting each two consecutive copies, exactly one of the assertions holds: 
P2m—1(Ga) E Xa Or po(Ga+1) E Xa+1- 

We are left with the converse implication. For i = 1,2,...,n, let P = 
G[U2t} V (P!(Ga))]. Observe that P' is a path on (n + 1) - 2m vertices, and 
thus the maximum size of an independent set in P* is (n+1)m. Assume that 
G admits an independent set X of size k. We then have that |X n V(P*)| < 
(n + 1)m for each i = 1,2,...,n, and by Claim [14.39] we have that |X N 
V (Aj (Ga))| < |C;| +2 for each j = 1,2,..., mand a = 1,2,...,n + 1. Since 
the sum of these upper bounds through all the paths P’ and gadgets H;(Ga) 
is exactly equal to k, we infer that all these inequalities are in fact equalities: 
IXAV(PŻ)| = (n+1)m for each i = 1,2,...,n, and |XXAOV(H;(Ga))| = |C;|+2 
for each j = 1,2,...,m and a = 1,2,... n + 1. 

Examine now one path P’ and observe that any independent set of size 
(n+ 1)m in Ê’ can leave at most one pair of consecutive vertices on P* not 
chosen. We will say that the copy Ga is spoiled by the path P' if V(P?(G,))\X 
contains a pair of consecutive vertices on P’(G,). It then follows that each 
path P* can spoil at most one copy Ga, so there exists at least one copy 
Ga, that is not spoiled at all. Recall that Gao is a copy of Go; let Xo C 
V (Go) be the set of originals of the vertices of the set V (Gas) OX. We then 
have that Xo contains every second vertex on each path P’, beginning with 
pi and finishing on pj,,, 9, or beginning with på and finishing on p},,,_1. 
Moreover, Xo N V (4H) is a maximum-size independent set in each gadget H;. 
By Claim [14.39] it follows that for each 7 = 1,2,...,m there exists an index 
aj such that Ka, (H;) € Xo. 

Let Y: Vars — {L,T} be an assignment defined as follows: ẹy(z;) = 
Lif XG ITV (PY) = {p$ 0, .-.,Ém-2}, and le) = T if Xo N V(PŻ) = 
{p{,P3,--+,D>m—1}- Consider now any clause Cj, and let x;, be the variable 
appearing in the literal G. Recall that ka, (Hj) € Xo, and it is adjacent to 
P2 if i = 2;,, or to Poy if tii = -2;,. As Xo is independent, we infer 
that this vertex on P* does not belong to Xo. Hence o(a;,) = T if esi = T; 
and #(x;,) = L if ei = 72;,, by the definition of w. In both cases we can 
conclude that the literal i satisfies C} in the assignment p. Since C; was 
chosen arbitrarily, we infer that w satisfies y. 














Under SETH we can also prove the optimality of known dynamic-programming 
routines for many other problems. The following theorem provides a handful 
of the most important examples (note that the lower bound for q- COLORING 
is given only for tree decompositions and not path decompositions). 
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Theorem 14.41 ([113} [118] [325]). Assume that CNF-SAT cannot be 
solved in time O*((2—«')") for any €' > 0. Then for every £ > 0 the following 
holds (p/t is the width of a given path/tree decomposition of the input graph): 


DOMINATING SET cannot be solved in time O*((3 — €)”); 

ODD CYCLE TRANSVERSAL cannot be solved in time O*((3 — €)”); 
g-COLORING cannot be solved in time O* ((q—«)'), for any constant q > 3; 
STEINER TREE cannot be solved in time O*((3 — €)?); 

FEEDBACK VERTEX SET cannot be solved in time O*((3 —«)?); 
CONNECTED VERTEX COVER cannot be solved in time O*((3 — €)?); 
HAMILTONIAN CYCLE cannot be solved in time O*((2 + V2 —€)?). 


Note that for the last three points, naive dynamic-programming routines 
work in slightly super-exponential time, which can be improved to single ex- 
ponential using the techniques of Chapter [I1] In particular, the lower bounds 
given by Theorem [14.41] for STEINER TREE, FEEDBACK VERTEX SET, and 
CONNECTED VERTEX COVER match the randomized upper bounds given by 
the Cut & Count technique. Therefore, in these examples the bases of ex- 
ponents in running times given by Cut & Count are not just coincidental 
numbers stemming from the technique, but match inherent properties of the 
problems themselves. 

Observe also that all the lower bounds mentioned in Theorem prove 
that the optimal base of the exponent is a number larger than 2, in most 
cases 3. In the case of INDEPENDENT SET the reduction intuitively embed- 
ded the 2” search space into 2? states of the dynamic program, which was 
very convenient as we could use a one-to-one correspondence between vari- 
ables x; and paths P;. In the case of, say, DOMINATING SET, the reduction 
should embed 2” possible variable assignments into 3” states of the dynamic 
program, where n’ = n- log, 2. Hence, in the resulting decomposition we can- 
not have one element of a bag per each variable, but every element of a bag 
has to bear information about more than one variable; more precisely, about 
log, 3 ~ 1.585 variables. Needless to say, this creates technical difficulties in 
the proof. 

These difficulties can be overcome using a similar technique as in the proof 
of the lower bound for HITTING SET parameterized by the size of the universe 
(Theorem |14.35). We namely partition the variables into groups of size q for 


some large constant q, and assign q = ats | vertices of a bag to each 
2 


group. In this manner, for each group we can encode all 27 < 3 variable 
assignments as possible interactions of the solution with the assigned subset 
of the bag. Also, the pathwidth of the resulting graph will be approximately 
n: £ < RAE -(1+ a) for some a converging to zero as q grows to infinity. 
As in the case of Theorem it is somewhat problematic to extract an 
assignment of a particular clause from a group in order to check it against 


a clause gadget. This is done via an additional layer of group gadgets that 
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are responsible for decoding the assignment for a group; we refer the reader 


to [118] for details. 


14.5.3 A refined lower bound for DOMINATING SET 


In Section [14.4] we have seen that ETH can be used to provide estimates of 
the magnitude of the function of k appearing in the exponent of n for W[1]- 
hard problems. In particular, in Theorem [14.23] we have proved that, unless 
ETH fails, DOMINATING SET does not admit an algorithm with running time 
f(k) - n°) for any computable function f. Again, it appears that having 
assumed SETH we can establish even more precise lower bounds. This is 
exemplified in the following theorem. 


Theorem 14.42. Unless CNF-SAT can be solved in time O*((2 —«')”) for 
some €' > 0, there do not exist constants € > 0, k > 3 and an algorithm 
solving DOMINATING SET on instances with parameter equal to k that runs 
in time O(N*-*), where N is the number of vertices of the input graph. 


Proof. For the sake of contradiction assume that such constants ¢ > 0, k > 3 
and such an algorithm does exist. We are going to present an algorithm for 
CNE-SAT working in time O*(2°") for 6 =1—£ <1. 

Let y be the input instance of CNF-SAT, let Vars be the variable set 
of p, and let n,m be the numbers of variables and clauses of p, respec- 
tively. Partition Vars into k parts Vars, Varsg,...,Vars x, each of size at 
most [n/k]. For every part Vars; and each of 2lVa7s:| possible assignments 
Y: Varsi > {L, T}, create a vertex vý. For each i € [k], create an additional 
vertex w’ and let V; be the set of vertices vi, created for the part Vars; plus 
the vertex wt. Make V; into a clique. For each clause C of y, create a vertex 
uc. For every i € [k] and every assignment 7 of Vars;, create an edge ucry, 
if and only if Vars; contains a variable x; whose literal appears in C, and 
moreover 7 evaluates x; so that this literal satisfies C. Let G be the resulting 
graph and let N = |V(G)|. Observe that N < k-(14+2!"/'l) +m = O*(2"/*), 
so G contains at most (Y) < O*(2?"/*) < O*(2?"/3) edges. Moreover, G can 
be constructed in time O*(2?"/") < O*(22”/3), 


The intuition is as follows. Vertices w; ensure that in the optimum so- 
lution we will need to pick exactly one vertex from every clique V;. The 
vertex picked from V; encodes the assignment of variables of Vars;, and 
domination of each vertex uc checks satisfaction of the corresponding 
clause C. Thus, the space of all the Ma |V;| k-tuples that are candi- 
dates for a dominating set in G corresponds to the space of all the 2” 
variable assignments for the input formula g. 
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We now claim that G admits a dominating set of size k if and only if ọ is 
satisfiable. If p admits a satisfying assignment Y: Vars —> {1, T}, then let 
us construct a set X = {oy : i € [k]}. Observe that |X| = k and X isa 


dominating set in G: each clique V; is dominated by viy? while each vertex 


uc is dominated by the vertex Vig for any part V; that contains a variable 


whose literal satisfies C. 

Conversely, let X be a dominating set of size k in G. Since every vertex 
wt is dominated by X and N[w;] = V;, we infer that X contains exactly one 
vertex from each clique V;, and no other vertex. Let us define assignment 
wi: Varsi; > {L,T} for each i € [k] as follows: if X NV; = {wi} then put 
an arbitrary assignment as p;i, and otherwise p; is such that X A V; = {vi }- 
Let Yy = Uie] Yi; we claim that the assignment w satisfies the input formula 
vy. Take any clause C and examine the vertex uc. Since uc ¢ X, uc is 
dominated by a vertex Uy, € X for some i € [k]. By the definition of the edge 
set of G we infer that Vars; contains a variable whose literal is present in C, 
and which is evaluated under the assignment 7; so that C becomes satisfied. 
Since w; = U|vars,, the same literal satisfies C in the assignment w. 

To conclude the proof, observe that we could solve the CNF-SAT prob- 
lem by pipelining the presented reduction with the supposed algorithm for 
DOMINATING SET on instances with parameter k. As argued, the reduction 
runs in time O*(2?"/3), while the application of the algorithm for DOMINAT- 
ING SET takes time O(N*-*) = 2% G-«/*) . (n + m)O™. Since e > 0 and k 
is a constant, this means that CNF-SAT can be solved in time O*(2°”) for 
d=1-_<1. 














Note that the trivial algorithm for DOMINATING SET that verifies all k- 
tuples of vertices runs in time O(N*-(N-+M)) on graphs with N vertices and 
M edges. However, it is known that this algorithm can be slightly improved, 
to running time O(N*+%(). Thus, Theorem [14.42] essentially states that it 
is hard to break the barrier of the cardinality of the brute-force search space. 


Exercises 


14.1 (Æ). Prove the randomized analogues of Theorems and 


(a) Assuming randomized ETH, there exists a constant c > 0 such that no randomized 
two-sided error algorithm for 3-SAT can achieve running time O*(2¢"+™)), 
(b) Randomized SETH implies randomized ETH. 


14.2. Prove Theorem that is, verify that the following problems admit linear reduc- 
tions from 3-SAT: 


(a) VERTEX COVER (&), 

(b) Dominatine SET (&), 

(c) FEEDBACK VERTEX SET (Æ), 
(d) 3-COLORING, 
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(e) HAMILTONIAN CYCLE. 


14.3. Prove Theorem that is, verify that the following problems admit reductions 
with linear output size from the PLANAR 3-SAT problem (assuming the existence of a 
variable circuit, if needed): 


a) PLANAR VERTEX COVER, 

b) PLANAR DOMINATING SET, 

c) PLANAR FEEDBACK VERTEX SET, 
d) PLANAR 3-Cotorine (&), 

(e) PLANAR HAMILTONIAN CYCLE (8). 


( 
( 
( 


14.4 (&). Using Theorem|14.13]as a black box, prove that the existence of an O* (2°(# log k)) 
algorithm for k x k PERMUTATION CLIQUE would contradict randomized ETH. 


14.5. In the 2k x 2k BIPARTITE PERMUTATION INDEPENDENT SET we are given a graph 
G with the vertex set [2k] x [2k], where every edge is between Iı = [k] x [k] and Ig = 
([2k] \ [k]) x ([2k] \ [k]). The question is whether there exists an independent set X C I Ul2 
in G that induces a permutation of [2k]. Construct a polynomial-time reduction that takes 
an instance of k x k PERMUTATION CLIQUE and outputs an equivalent instance of 2k x 2k 
BIPARTITE PERMUTATION INDEPENDENT SET. Infer that 2k x 2k BIPARTITE PERMUTATION 
INDEPENDENT SET does not admit an O*(2°(* log k)) algorithm unless ETH fails. 


14.6. Construct a polynomial-time reduction that takes an instance of 2k x 2k BIPARTITE 
PERMUTATION INDEPENDENT SET and outputs an equivalent instance of 2k x 2k PERMU- 
TATION HITTING SET WITH THIN SETS. Infer that k x k PERMUTATION HITTING SET WITH 
THIN SETS does not admit an O*(2°(k log k)) algorithm unless ETH fails, which is the first 
half of Theorem 


14.7 (Æ). Construct a polynomial-time reduction that takes an instance of k x k PERMU- 
TATION HITTING SET WITH THIN SETS and outputs an equivalent instance of k x k HITTING 
SET WITH THIN SETS. Infer that k x k HITTING SET WITH THIN SETS does not admit an 
O* (20k loz *)) algorithm unless ETH fails, which is the second half of Theorem|14.16| 


14.8 ($). In the Maximum CycLE Cover problem the input consists of a graph G and 
an integer k, and the question is whether one can find a collection of at least k vertex- 
disjoint cycles in the graph so that every vertex belongs to exactly one of these cycles. 
Prove that unless ETH fails, Maximum CycLE Cover does not admit an FPT algorithm 
with running time O*(2°(P log P)), where p is the width of a given path decomposition of 
G. 


14.9. In the CoMPONENT ORDER INTEGRITY problem we are given a graph G and two 
integers k and £. The task is to determine whether there exists a set of vertices X of cardi- 
nality at most k such that every connected component of G\ X contains at most £ vertices. 
Prove that COMPONENT ORDER INTEGRITY admits an FPT algorithm with running time 
O*(20(klog£)), but the existence of an algorithm with running time O*(2°(* los £)) would 
contradict ETH. 


14.10 (È). Using Theorem [14.20] as a black box, prove the following statement: There 
exists a universal constant 6 > 0 such that, unless P Æ NP, there is no constant A and a 
kernelization algorithm for EDGE CLIQUE Cover that reduces the number of vertices to 
at most \- 2°*. In particular, Enee Cirque Cover does not admit a 2°(*) kernel unless 
P=NP. 


14.11 (&). Modify the construction in the proof of Theorem]13.31|to get a parameter- 
ized reduction from SUBGRAPH ISOMORPHISM to ODD SET where the parameter of the 
constructed instance is linear in the number of edges of H in the SUBGRAPH ISOMORPHISM 
instance. 
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14.12 (&). Modify the construction in the proof of Theorem|13.33]to get a parameterized 
reduction from SUBGRAPH ISOMORPHISM tO STRONGLY CONNECTED STEINER SUBGRAPH 
where the parameter of the constructed instance is linear in the number of edges of H in 
the SUBGRAPH ISOMORPHISM instance. 


14.13. The input of the UNIT SQUARE INDEPENDENT SET problem is a set of axis-parallel 
unit squares in the plane and an integer k; the task is to select a set of k pairwise disjoint 
squares. Show that UNIT SQUARE INDEPENDENT SET is W[1]-hard and, unless ETH fails, 
it has no f(k)n°(Y*)-time algorithm for any computable function f. 


14.14 (Æ). Prove Claim|14.39 
14.15 (Æ). Prove Claim|14.40 


14.16. In the NAE-SAT problem the input consists of a formula ọ in the conjunctive 
normal form (CNF), and the question is whether there exists an assignment of the variables 
such that in each clause of ọ there is at least one literal evaluated to true and at least one 
evaluated to false. Prove that, unless SETH fails, for any € > 0 there does not exist an 
algorithm that solves NAE-SAT in time O*((2—¢)”), where n is the number of variables 
appearing in y. 


14.17. Show that the standard reduction from the g-SAT problem to INDEPENDENT SET 
proves that unless SETH fails, for any £ > 0 there is no O*((/2 — <)?)-time algorithm for 
the INDEPENDENT SET problem given a path decomposition of width p. 


14.18 (È). In the CONNECTED DOMINATING SET problem the input consists of a graph 
G and an integer k, and the question is whether there exists a subset X C V(G) of at most 
k vertices such that G[X] is connected and N[X] = V(G). Prove that, unless SETH fails, 
for any £ > 0 there does not exist an algorithm that, given a CONNECTED DOMINATING 
SET instance (G,k) together with a path decomposition of G of width p, solves (G,k) in 
time O*((4—)?). 


14.19. Prove that, given a graph G, one can check whether G has a dominating set of size 
at most 2 in O(|V(G)|”) time, where w is the matrix multiplication exponent. 


Hints 


Follow the lines of the proofs of Theorems and but, whenever some algo- 
rithm Ae is used, repeat its usage many times in order to reduce the error probability. 


Look at classic NP-hardness reductions for these problems that can be found in the 
literature. 


(14:3) 


(a) Replace variables and clauses by gadgets, but watch out not to spoil planarity around 
a variable gadget when edges corresponding to positive and negative occurrences in- 
terleave around the vertex corresponding to the variable. To fix this, use a cycle of 
length 2p for a variable gadget, where p is the number of occurrences of the variable. 

(b) Design a similar construction as for PLANAR VERTEX COVER, but use a cycle of length 
3p. 

(c) Modify slightly the construction for PLANAR VERTEX COVER by applying the standard 
reduction from VERTEX COVER to FEEDBACK VERTEX SET at the end. 
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Fig. 14.10: The ladder graph 


(d) Use the variable circuit x1 — £2 — . . .— Zn — £1 to propagate colors chosen to represent 
true and false assignment. Carefully design a planar variable gadget of size O(p), 
where p is the number of occurrences of the variable. Make sure that your gadget both 
propagates colors representing true and false to the next variable, and also makes use 
of them in the check against the clause gadget. The clause gadget can be of constant 
size. 

(e) Use the variable circuit xı —x2—...—£n— £1 as the ‘spine’ of the intended Hamiltonian 
cycle. The Hamiltonian cycle should be able to make small detours from the spine into 
clause gadgets, and all clause gadgets can be visited if and only if the input formula 
is satisfiable. Use the ladder graph depicted in Fig. as your variable gadget. 
In the proof of correctness you will need to be very careful when arguing that the 
Hamiltonian cycle cannot misbehave and jump between distant variable gadgets via 
clause gadgets. 


Start with the randomized version of Theorems i.e., Exercise (a). and verify 


that the chain of reductions to k x k CLIQUE via 3-COLORING gives also a randomized lower 
bound. 


14.5] By taking the complement of the graph, you may start with k x k PERMUTATION 
INDEPENDENT SET. In the constructed instance of 2k x 2k BIPARTITE PERMUTATION IN- 
DEPENDENT SET, the only possible independent sets should induce the same permutation 
pattern on J; and on I2, and this permutation pattern should correspond to a solution to 
the input instance of k x k PERMUTATION INDEPENDENT SET. 


Encode the constraint imposed by each edge of G using one thin set. Remember to 
force the solution to be contained in Jı U I2, which can also be done using thin sets. 


The requirement that each column must contain at least one vertex of the solution 
can easily be encoded using thin sets. 


Follow the strategy sketched in the paragraph after Theorem[14.19] Start from k x k 
HITTING SET WITH THIN SETS and encode choice of the solution as a choice of a perfect 
matching in a Ky ,-biclique in the first bag. The main spine of the decomposition should 
be formed by 2k paths originating in the 2k vertices of the biclique; along these paths k 
cycles should ‘travel’ at each point. A gadget verifying a set should allow closing one cycle 
and beginning a new one in its place if and only if the set is hit by the solution. 


14.9] For the positive result, apply a simple branching strategy. For the negative result, 
start the reduction from k x k CLIQUE and have £ = k0), 


Since EDGE CLIQUE Cover is in NP, there exists a polynomial-time reduction 
from EDGE CLIQUE Cover to 3-SAT. Consider the composition of (a) the reduction of 
Theorem (b) a supposed kernelization algorithm, and (c) the reduction back from 
EDGE CLIQUE CovER to 3-SAT. Calculate an upper bound on the size of the output 
instance in terms of the input one. 


14.11) Let v1, ..., vjv(m)] be the vertices of H. We include F;,; in the universe only if v; 
and vj are adjacent. The parameter is set to k’ = k+|E(H)|. 
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14.12) Let vi, ..., Vivi); be the vertices of H. We include the vertex y;,; and the 
set of vertices E; , in G’ only if v; and vj are adjacent. The parameter is set to k’ = 
k+1+2\|E(H)| £ |K| +k + |E(H)]|. 


14.13] The reduction in the proof of Theorem [14.34] does not work with squares: the 
squares at (x +ae,y+ae) and (x +1+ac,y+1+ae) can intersect at their corners, which 
was not an issue for unit disks. We show two possible workarounds. 

Solution 1: Represent each pair (a,b) E€ S[x, y] with the five squares at (3x +ae, 3y+ae), 
(3x +1+ae,3y+ae), (3x + ac, 3y + 1 + ae), (3x — 1 + ae, 3y + ae), (3x + ae, 3y — 1 + ae). 
Set the new parameter to k’ = 5k. 

Solution 2: Represent each pair (a,b) € S[x, y] with a square at (xz+ae—y/2, y+ae+a/2). 








14.17| Create a clique of size q for each clause and a clique of size 2 for each variable. 
Connect each vertex of each clique of size q with its negation in the corresponding clique 
of size 2. Prove that the resulting graph has pathwidth at most 2n + O(1) for constant q. 


As in the lower bound for INDEPENDENT SET, the main part of the construction 
consists of roughly p long “strings” that transmit a fixed choice of an assignment ~. How- 
ever, as we aim at s (4 — £)P lower bound, each “string” needs to transmit information 
about the assignment of two variables. In the CONNECTED DoMINATING SET problem, the 
four states of a vertex on a string may be chosen as follows: (1) not in the solution, but 
dominated; (2) not in the solution, and not yet dominated; (3) in the solution, and already 
connected to some “root vertex” of the constructed solution”; and (4) in the solution, but 
not yet connected to the “root vertex”. 
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Theorem are due to Marx [349]. 

Alber and Fiala [9] presented an nO(Vk) time algorithm for Unit Disk INDEPENDENT 
SET using a geometric separator theorem. However, as discussed by Marx and Sidiropou- 
los [359], one can obtain this result by a simple application of the shifting technique. The 
W[1]-hardness of Unit Disk INDEPENDENT SET was proved by Marx [345], but with a 
reduction that increases the parameter more than quadratically, hence giving only a lower 
bound weaker than the one stated in Theorem[14.34] The Grib Tilna problem was for- 
mulated by Marx [348], obtaining the tight lower bound of Theorem for Unit Disk 
INDEPENDENT SET, as well as an analogous result for UNIT SQUARE INDEPENDENT SET. 
The lower bound of Theorem [14.32] and the algorithm of Theorem [14.31] for SCATTERED 
Set is by Marx and Pilipczuk [355]. The Grip TILING problem was used by Marx to 
show that, assuming ETH, there is no f(k) - nV0)-time algorithm for EDGE MULTICUT 
on planar graphs with @ terminals for any computable function f, essentially matching 
the 20 .nO(V%-time algorithm of Klein and Marx [295]. Chitnis, Hajiaghayi, and Marx 
used Grip TILING to prove that, assuming ETH, there is no f(k) - no(Vk)-time al- 
gorithm for STRONGLY CONNECTED STEINER SUBGRAPH on planar graphs, parameterized 
by the number k of terminals, for any computable function f. Additionally, they presented 
a 2O(k log k) . pOl(Vk) -time algorithm for planar graphs (on general graphs, an earlier al- 
gorithm of Feldman and Ruhl solves the problem in time n©(*) time). Marx and 
Pilipczuk presented a number of W/[1]-hardness results for various parameterizations 
of SUBGRAPH ISOMORPHISM; many of these reductions start from Grip TILING. The GRID 
TILING WITH < problem was formalized by Marx and Sidiropoulos [359], who also studied 
higher-dimensional generalizations of Grip TILING and Grip TILING WITH < and obtained 
tight lower bounds for higher-dimensional geometric problems. 

The lower bound for HITTING SET parameterized by the size of the universe was given by 
Cygan, Dell, Lokshtanov, Marx, Nederlof, Okamoto, Paturi, Saurabh, and Wahlström [112]. 
The work of Cygan et al. contains a much larger net of reductions, which links many classic 
parameterized problems either to SETH or to the Set Cover Conjecture (Conjecture[14.36). 
In particular, the Set Cover Conjecture is formulated in [112]. Notably, the authors consider 
also parity variants of the considered problems. In this setting the analogue of the Set Cover 
Conjecture is indeed equivalent to the analogue of SETH, which provides further evidence 
in favor of the Set Cover Conjecture. 

The framework for proving SETH-based lower bounds for treewidth/pathwidth param- 
eterizations was introduced by Lokshtanov, Marx, and Saurabh [325]. In particular, the 
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presented lower bound for INDEPENDENT SET (Theorem [|14.38) is due to them. The first 
three lower bounds of Theorem |14.41] were also obtained in [325], while the next three, 
as well as that of Exercise|14.18] was given by Cygan, Nederlof, Pilipczuk, Pilipczuk, van 
Rooij, and Wojtaszczyk as complementary results for their Cut & Count technique. 
The last lower bound, for HAMILTONIAN CYCLE, was given by Cygan, Kratsch, and Ned- 
erlof [113]. Interestingly, in the same work the authors give a matching algorithm with 
running time O*((2 + V2)”), where p is the width of a given path decomposition. 

An O(N*+°x()) algorithm for DominatiINa SET was proposed by Eisenbrand and 
Grandoni [163]. The matching lower bound under SETH, i.e., Theorem [14.42] together 
with the algorithm of Exercise |14.19} is due to Pătraşcu and Williams [390]. Interestingly, 
Pătraşcu and Williams viewed this theorem not as a lower bound, but rather as a possible 
approach to obtaining faster algorithms for CNF-SAT. 


Chapter 15 
Lower bounds for kernelization 


In this chapter we discuss methodologies for proving 

lower bounds for kernelization. We describe the tech- 

nique of compositionality, which is the main tool for ° 
proving negative results about the existence of poly- 
nomial kernels. We also give an introduction to weak 
compositions, which can be used to provide more pre- 
cise lower bounds on kernel sizes. 





In Chapters [f] [9] and [12] we have learned many different approaches to 
designing efficient kernelization algorithms. While some parameterized prob- 
lems are naturally amenable to various preprocessing techniques, we have so 
far not discussed what properties of a given problem make it resistant from 
the point of view of efficient kernelization. In Chapter [2] we have seen that 
the existence of any kernelization algorithm is equivalent to the existence of 
a fixed-parameter tractable algorithm for a problem. Therefore, the theory 
of parameterized intractability that was discussed in Chapter [13] can be used 
to investigate whether a problem admits any kernel at all. This answer is, 
however, quite unsatisfactory: we have argued that the interesting kerneliza- 
tion algorithms are those that have good (say, polynomial) guarantees on the 
output size, and for many problems such algorithms can be indeed designed. 
Under what conditions and what assumptions can we refute the existence of 
a polynomial kernel for a particular fixed-parameter tractable problem? 

The best known answer to this question so far comes via the framework 
of compositionality. The development of this technique around the year 2008 
greatly increased the interest in kernelization as a research area. Indeed, the 
existence of a methodology for proving negative results about kernelization 
made it possible to perform a systematic classification of problems into those 
that do admit polynomial kernels, and those that probably do not. Before this 
breakthrough, one could only investigate how different combinatorial prop- 
erties of problems can be used in designing efficient preprocessing routines, 
but understanding their limitations was out of reach. 
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In this chapter we introduce the technique of compositionality. Fortunately, 
the complexity foundations of the framework are quite simple to explain, so 
many of the fundamental results to be presented will be proved in a self- 
contained way. However, our main goal in this chapter, as in Chapters 
and [14] is to provide a basic practitioner’s toolbox. Therefore, after presenting 
the complexity foundations for the technique we illustrate its usage with a 
few selected examples that reflect various scenarios appearing in “real life”. In 
particular, we discuss the notion of cross-composition, which is a convenient 
formalism for applying compositionality; we describe different strategies for 
designing compositions, focusing on the concept of an instance selector; and 
we present polynomial parameter transformations, which serve the role of 
reductions in the world of kernelization lower bounds. 

Finally, at the end of this chapter we present the technique of weak com- 
positions. It is a variant of compositionality, using which one can prove lower 
bounds on kernel sizes for problems that actually do admit polynomial ker- 
nels. We provide full details of the two probably most important results of this 
area: that the VERTEX COVER and FEEDBACK VERTEX SET problems do not 
admit kernels with bitsize O(k?~*) for any € > 0, unless NP ¢ coNP/ poly. 


15.1 Compositionality 


Since the principles of compositionality are very natural, whereas the formal 
layer can be more challenging to understand, we first give a description of 
the intuition behind the methodology before going into formal details. 

Let us take as the working example the LONGEST PATH problem: given 
a graph G and an integer k, we ask whether G contains a simple path on 
k vertices. Assume for a moment that this problem admits a kernelization 
algorithm that reduces the number of vertices to at most. k. More formally, 
given an instance (G, k) it outputs an equivalent instance (G’,k’) such that 
|V(G’)|, k’ < k’. Suppose somebody gave us k” instances of LONGEST PATH 
with the same parameter k; denote them by (G1, k), (G2, k), ..., (Gkr, k). 
Consider now a graph H obtained by taking the disjoint union of graphs 
G1,Go,...,G,7, and observe that H contains a path on k vertices if and only 
if at least one of the graphs G',G2,...,G ,7 does. In other words, the answer 
to the instance (H, k) is equal to the logical OR of the answers to instances 
(G1, k), (Go, k),..., (Ger, k). We will also say that the instance (H, k) is an 
OR-composition of instances (G1, k), (G2, k), ..., (Gkr, k). 

Now apply the assumed kernelization algorithm to the instance (H, k), 
and obtain some new instance (H’,k’) such that |V(H’)|,k’ < k’. Observe 


that (H', k’) can be encoded in roughly k8 /2 + 3log k bits: we can encode 
the adjacency matrix of H’ in (£) bits, while the encoding of k’ takes 3 log k 


bits. This is, however, much less than k”, the number of the input instances! 
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Therefore, during the kernelization process the algorithm must have 
‘forgotten’ information about a vast majority of instances. The desired 
k-path, however, can lure in any of them. Losing information about 
this instance can change the global answer to the problem, if it is the 
only one with the positive outcome. Consequently, it seems that the 
kernelization algorithm would need to be able to solve some of the given 
instances, or at least to reason about which of them are less prone to 
having a positive answer and can be safely discarded. Such an ability 
would be highly suspicious for an algorithm running in polynomial time, 
since the LONGEST PATH problem is NP-hard. 


Of course, there is nothing magical in the numbers 3 and 7 that we picked 
in this example, and the reasoning is much more general: a kernelization 
algorithm for LONGEST PATH with polynomial guarantees on the output size 
would need to discard some information that it should not be able to identify 
as useless. Our goal is therefore to put into formal terms why the described 
behavior is unlikely, and link this understanding to some well-established 
assumptions from complexity theory. 


15.1.1 Distillation 


We first aim to formally capture the intuition that some information has to 
be lost when packing too many instances into a too small space in the kernel. 
In order to achieve this, we introduce the concept of distillation, which will 
be a result of pipelining a composition for a parameterized problem with a 
hypothetical polynomial kernel for it. Thus, by (a) designing a composition, 
and (b) proving that the existence of a distillation is unlikely, we will be 
able to refute the existence of a polynomial kernel. This section is devoted 
to understanding issue (b). Since parameterizations will be needed only to 
formally capture the concept of kernelization, the definition of distillation 
considers classic (that is, not parameterized) languages. 

Let X be some fixed, constant-size alphabet that we will use for encoding 
the instances. 


Definition 15.1. Let L, R C X* be two languages. An OR-distillation of L 
into R is an algorithm that, given a sequence of strings 71, £2,..., £4 E X*, 
runs in time polynomial in as |x;| and outputs one string y € X* such 
that 


(a) |y| < p(maxt_, |x;|) for some polynomial p(-), and 
(b) y € R if and only if there exists at least one index i such that x; € L. 
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Thus, the answer to the output instance of R is equivalent to the logical 
OR of the answers to the input instances of L. 

In what follows we will be considering the complexity class coNP/ poly. We 
assume that the reader is familiar with the concepts of co-nondeterminism 
and nonuniform complexity classes; we refer her or him to any textbook on 
complexity theory for an introduction to these topics. However, for the sake 
of clarity we recall now the definition of coNP/ poly. 


Definition 15.2. We say that a language L belongs to the complexity 
class coNP/ poly if there is a Turing machine M and a sequence of strings 
(Qn )n=0,1,2,.... Called the advice, such that: 


e Machine M, when given input x of length n, has access to string a, and 
has to decide whether x € L. Machine M works in co-nondeterministic 
polynomial time. That is, it makes a polynomial number of steps that may 
be chosen co-nondeterministically: If « € L, the algorithm should derive 
this conclusion for every possible run, whereas if x ¢ L, then at least one 
run needs to finish with this conclusion. 

e |a,| < p(n) for some polynomial p(-). 


Note that in this definition a,, depends on n only; in other words, for each 
n we need to design a “global” advice a,, that will work for all the inputs of 
length n. We are now ready to state the crucial result standing behind the 
compositionality framework. 


Theorem 15.3. Let L,R C X* be two languages. If there exists an OR- 
distillation of L into R, then L € coNP/ poly. 


Before we proceed to the proof, let us see the consequences of the theorem 
in the form of the following corollary. In the following, whenever we talk 
about NP-hardness we mean NP-hardness with respect to Karp reductions. 


Corollary 15.4. If an NP-hard language L C &* admits an OR-distillation 
into some language R C &*, then NP C coNP/ poly. 


Proof. For any language L’ € NP we can construct an algorithm resolving 
membership in L’ in coNP/ poly as follows. We first apply the NP-hardness 
reduction from L’ to L, and then run the algorithm resolving membership in 
L in coNP/ poly implied by Theorem [15.3] Consequently NP C coNP/ poly, 
which is a highly unexpected outcome from the point of view of the complex- 
ity theory. 














The assumption that NP É coNP/ poly may be viewed as a stronger vari- 
ant of the statement that NP 4 coNP. The latter claim means that verifica- 
tion of certificates in polynomial time cannot be simulated by verification of 
counterexamples in polynomial time. In the claim that NP É coNP/ poly 
we assume further that such a simulation is impossible even if the co- 
nondeterministic polynomial-time algorithm resolving a problem in NP is 
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given access to some polynomial-size advice string that depends on the input 
length only. It is known that NP C coNP/ poly implies that X? = PH, that 
is, the polynomial hierarchy collapses to its third level. This is not as dramatic 
a collapse as P = NP, but it is serious enough to be highly implausible. 

Note here that we do not assume anything about the target language R 
— it can be even undecidable. Therefore, in some sense the theorem has 
an information-theoretical character. It shows that the possibility of packing 
a lot of information into small space gives rise to a surprising algorithm 
resolving the problem, lying roughly on the opposite side of the polynomial 
hierarchy than expected. The way of organizing the information in the output 
instance is not relevant at all. 


Proof (of Theorem|15.3). Without loss of generality assume that X = {0,1}; 
this can be easily done by encoding symbols of a larger alphabet using 
constant-length binary strings. Let A be the assumed OR-distillation algo- 
rithm, and let p(-) be a polynomial upper bounding the length of the output 
of A. Without loss of generality we may assume that p(-) is nondecreasing. 
Let K = p(n) so that algorithm A run on a sequence of strings, each of length 
at most n, outputs a string of length at most K. Let us fix t = K +1. Thus, 
algorithm A maps the set D = (X’S")* of t-tuples of input strings into the 
set YSK 

Let now A = LA XS” and A = YS" \ L be the sets of yes- and no- 
instances of L of length at most n, respectively. Similarly, let B = RN YSK 
and B = YS* \ R. By the assumption that A is an OR-distillation, we have 
that A maps (A)! into B, and D \ (A)* into B. 

For strings x € YS" and y € YSK, we will say that x is covered by y if 
there exists a t-tuple (a1, £2,..., £4) E€ D such that (a) x = x; for some i, 
and (b) A(a1,22,...,24) = y. In other words, x is contained in some tuple 
from the preimage of y. More generally, for X C YS" and Y C YSK we will 
say that X is covered by Y if every element of X is covered by at least one 
element of Y. The following claim will be the core argument in the proof. 


Claim 15.5. There exists a set Y C B such that 


(a) |Y| <n+1, and 
(b) Y covers A. 


Since A~1(B) = (A)‘, no element of B can cover any string outside A. 
Of course, B itself covers A, but the size of B can be huge in terms 
of n. The gist of the proof is that we can find a covering set of size 
polynomial in n, rather than exponential. 





1 For a set S, notation St and SS+ denotes the set of sequences of elements from S that 
have length exactly t, resp. at most t. 
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Before we proceed to the proof of Claim let us shed some light on 
its motivation by showing how it implies Theorem That is, we would 
like to prove that L € coNP/ poly. According to the definition, we need to 
construct (a) an algorithm resolving membership in L in co-nondeterministic 
polynomial time, and (b) a sequence of advice strings an for n = 0,1,2,... 
that will be given to the algorithm. 

In our case, advice a, will be an encoding of the covering set Y. Since Y 
contains at most n+ 1 strings, each of length at most K, the advice string a, 
will be of length polynomial in n. The algorithm works as follows. Given the 
input string x of length n, it tries to prove that x ¢ L. If this is the case, then 


there exists a tuple (£1, £2,..., £4) E€ D such that x = zx; for some i, and 
A(a1,¥2,...,24) E€ Y. The algorithm guesses co-nondeterministically this 
tuple, computes A(x, 22,...,24), and checks whether the result is contained 


in the advice string an. If indeed x ¢ L, then for at least one guess we will 
compute a string belonging to Y, which is a certificate that x ¢ L since 
Y C B. If x € L, however, then by the definition of an OR-distillation every 
tuple containing x is mapped to a string contained in B, so in particular 
outside of Y. 

Let us examine how the polynomial advice is used in this argument: 
Claim [15.5] provides us only a proof of the existence of a small covering set 
Y, and no means of computing it efficiently. Indeed, computation of this set 
would be extremely difficult, as it depends on the actual behavior of algorithm 
A on all the inputs from D. Instead, this set is provided to the algorithm in 
the advice string, so there is no need to construct it. The crux is that the 
constructed covering set Y works for all the input strings x of length n, and 
hence we can have the same advice for all the inputs of the same length. 

We now proceed to the proof of Claim[I5.5] We will consecutively construct 
strings y1, Y2, Y3,..-. E B up to the point when the set Y; = {y1,yo,.--, yi} 
already covers the whole A. Let S; be the set of strings from A that are 
not covered by Y;; we have that So = A. During the construction we will 
guarantee that |S;| < A, which is of course satisfied at the beginning. Since 
|A| < |XS”| < 2”+1 (here we use the assumption that |X| = 2), the construc- 
tion will terminate after at most n + 1 steps with S; being empty. 

It remains to show how to construct string y; based on the knowledge of 
Y;—ı. Recall that algorithm A maps every tuple from the set (Si—1)t C (A) 
into B. Since |B| < |ZSK| < 2*+1, it follows by the pigeonhole principle that 
there exists some y € B such that AT! (y) N (Si—1)t| > Sin = (S). 
Observe now that every string from $;_, that is contained in any tuple from 
A~!(y)A(S;_1)! is covered by y. Therefore, if we set y; = y, then every string 
from every tuple from the set A~!(y)N($;-1)¢ is contained in S;—1 \ Sj, since 
it gets covered. Consequently A~!(y) N (Si-1)* C (Si-1 \ Si)’, and hence 





(£=) < [ATHA (Siz) S| \ S| = bean \ Sl. 
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Si-1 \ Si mu 


Fig. 15.1: The core combinatorial argument in the proof of Claim [15.5] if a 
small t-dimensional product hypercube is contained in a large t-dimensional 
product hypercube, and it contains at least a x fraction of all the tuples, 
then the side of the small hypercube must be at least half as long as the side 
of the large one. The balls within the small hypercube depict tuples from 


ATTY) N (Si-1)* 


We infer that |S;—1 \ 9:| > LŽ, which means that |S;| < |$;-1|/2 and the 


condition |S;| < Al follows by a trivial induction. As we have argued, this 
means that the construction will terminate after at most n + 1 steps, yielding 


a feasible set Y. This concludes the proof of Claim and of Theorem 

















15.1.2 Composition 


We now formalize what it means to compose instances. Recall that in the 
LONGEST PATH example we assumed that all the input instances have the 
same parameter. It appears that we can make even stronger assumptions, 
which is formalized in the following definition. 


Definition 15.6. An equivalence relation R on the set X* is called a poly- 
nomial equivalence relation if the following conditions are satisfied: 


(a) There exists an algorithm that, given strings z, y € X*, resolves whether 
£ Ær y in time polynomial in |z| + |y]. 

(b) Relation R restricted to the set XS” has at most p(n) equivalence classes, 
for some polynomial p(-). 


Assume, for instance, that the input to our problem is some fixed encoding 
of an undirected graph. We can then define R as follows. First, we put into one 
equivalence class all the strings from &* that do not encode any graph. We 
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call such instances malformed, while all the other instances are well-formed. 
Next, note that we can choose the encoding in such a manner that graphs 
with encoding of length at most n have at most n vertices and at most n 
edges. Therefore, if we say that two well-formed instances are R-equivalent 
if the corresponding graphs have the same numbers of vertices and edges, 
then XS” is divided by R into at most n? + 1 classes (+1 comes from the 
malformed class) and condition (b) is satisfied. Of course, provided that we 
chose a reasonable encoding, condition (a) holds as well. If the input to the 
problem was a graph and an integer k, encoded in unary, then we could in 
addition refine R by requiring that the instances x and y have the same value 
of k — we would just increase the bound on the number of equivalence classes 
in YS” from n?+1 to n?+1. In the later sections we will see that the ability 
to make such assumptions about the input instances can be very useful for 
streamlining the composition. 
We now proceed to the main definition. 


Definition 15.7. Let L C X* be a language and Q C X* x N be a pa- 
rameterized language. We say that L cross-composes into Q if there exists 
a polynomial equivalence relation R and an algorithm A, called the cross- 
composition, satisfying the following conditions. The algorithm A takes as 
input a sequence of strings £1, %2,..., %4 E X* that are equivalent with re- 
spect to R, runs in time polynomial in S |x;|, and outputs one instance 
(y, k) € X* x N such that: 


(a) k < p(max'_, |x;| + logt) for some polynomial p(-), and 
(b) (y, k) € Q if and only if there exists at least one index i such that z; € L. 


Note that in this definition, contrary to OR-distillation, it is only the out- 
put parameter that is small, while the whole output string y may be even 
as huge as the concatenation of the input instances. Moreover, the output 
parameter can also depend poly-logarithmically on the number of input in- 
stances. This seemingly innocent nuance will be heavily exploited in the later 
sections. 

To give an example, observe that the HAMILTONIAN PATH problem cross- 
composes into LONGEST PATH parameterized by the requested path length 
as follows. For the equivalence relation R we take a relation that puts all 
malformed instances into one equivalence class, while all the well-formed in- 
stances are partitioned with respect to the number of vertices of the graph. 
The cross-composition algorithm, given a bunch of malformed instances, re- 
turns some trivial no-instance of LONGEST PATH, while given a sequence of 
graphs on n vertices, it returns the encoding of their disjoint union together 
with parameter k = n. For a reasonable encoding we have that n is bounded 
by the maximum length of an input string, and so condition (a) holds. Clearly, 
the disjoint union of graphs on n vertices has an n-vertex path if and only if 
one of the input graphs admitted a Hamiltonian path. Hence condition (b) 
is satisfied as well. 
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As the reader expects, pipelining the cross-composition from an NP-hard 
problem with a polynomial kernel will yield an OR-distillation of the NP- 
hard problem, implying that NP C coNP/ poly. However, since the output 
of an OR-distillation can be an instance of any language, we will be able to 
refute the existence of a larger class of preprocessing routines. 


Definition 15.8. A polynomial compression of a parameterized language 
Q C X* x N into a language R C X* is an algorithm that takes as input 
an instance (x, k) € X* x N, works in time polynomial in |z| + k, and returns 
a string y such that: 


(a) |y| < p(k) for some polynomial p(-), and 
(b) y € R if and only if (z, k) € Q. 


If |X| = 2, the polynomial p(-) will be called the bitsize of the compression. 


Whenever we talk about the existence of a polynomial compression and we 
do not specify the target language R, we mean the existence of a polynomial 
compression into any language R. 

Obviously, a polynomial kernel is also a polynomial compression by treat- 
ing the output kernel as an instance of the unparameterized version of Q. 
Here, by an unparameterized version of a parameterized language Q we mean 
a classic language Q C X* where the parameter is appended in unary af- 
ter the instance. The main difference between polynomial compression and 
kernelization is that the polynomial compression is allowed to output an in- 
stance of any language R, even an undecidable one. Clearly, if R is reducible 
in polynomial time back to Q, then pipelining the compression with the re- 
duction gives a polynomial kernel for Q. However, R can have much higher 
complexity than Q, and there are examples of natural problems for which 
there exists a polynomial compression, but a polynomial kernel is not known. 

We are now ready to state and prove the main theorem of this section. 


Theorem 15.9. Assume that an NP-hard language L cross-composes into a 
parameterized language Q. Then Q does not admit a polynomial compression, 
unless NP C coNP/ poly. 


Proof. Let A be a cross-composition of L into Q, let po(-) be the polyno- 
mial bounding the parameter of the output instance of A, and let R be the 
polynomial equivalence relation used by A. Assume further that Q admits a 
polynomial compression C into some language R. Let OR(R) be a language 
consisting of strings of the form d\#do#...7#d, such that for at least one 
index i it holds that d; € R. Here, # is some special symbol that is artificially 
added to X and is not used in any string from R. We are going to construct 
an OR-distillation of L into OR(R), which by Corollary [15.4] implies that 
NP C coNP/ poly. The algorithm is depicted in Fig. 

Let £1, 22,...,24 be the sequence of input strings and let n = max'_, |z;]. 
Apply the following polynomial-time preprocessing: examine the strings pair- 
wise and remove all the duplicates. Note that this step does not change 
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Fig. 15.2: The workflow of the algorithm in the proof of Theorem 


whether at least one of the strings belongs to L. Observe that the number of 
different strings over X of length at most n is bounded by X ;—o |X)’ < |2|. 
Hence, after removing the duplicates we have that t < |X|” +t, and so we can 
assume that logt = O(n). 

Partition the input strings into equivalence classes C},C2,...,Cq with 
respect to the relation R. By the definition of the polynomial equivalence 
relation, this step can be performed in polynomial time, and q, the number 
of the obtained classes, is bounded by some polynomial p;(n). 

For j = 1,2,...,q, apply the cross-composition A to the strings contained 
in the class Cj, obtaining a parameterized instance (c;,k;) such that (a) 
ky < po(maxzec, |x| + log|C;|), which is polynomially bounded in n and 
logt, and (b) (c;,k;) € Q if and only if there exists at least one instance 
x E€ Cj such that x € L. Consequently, there exists an index i, 1 < i < t, 
such that x; € L if and only if there exists an index j, 1 < j < q, such that 
(cj, kj) E Q. Since logt = O(n), we infer that for some polynomial po(-) it 
holds that k; < po(n) for j = 1,2,...,¢. 

Now apply the assumed compression algorithm C to each instance (c;,k;), 
obtaining a string d; such that d; € R if and only if (cj, kj) E€ Q. Since k; < 
p2(n) and |d;| < p3(k;) for some polynomial p3(-), we have that |d;| < pa(n) 
for some polynomial pa(-). 

Finally, merge all the strings dj into one instance d = d,##do#... #dq- 
From the construction it immediately follows that d € OR(R) if and only if 
there exists an index i, 1 < i < t, such that 2; € L. Hence, condition (b 
of OR-distillation is satisfied. For condition (a), we have that q < pi(n) and 
|d;| < pa(n). Consequently |d| < pi(n) - (pa(n) + 1) — 1, and condition (a 
follows. 
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As an immediate corollary of the described cross-composition of HAMIL- 
TONIAN PATH into LONGEST PATH, the NP-hardness of the HAMILTONIAN 
PATH problem, and Theorem |15.9| we obtain the following result. 


Corollary 15.10. LONGEST PATH does not admit a polynomial kernel unless 
NP C coNP/ poly. 


We remark that it is very rare for a problem to admit such a simple cross- 
composition as in the case of LONGEST PATH. Designing a cross-composition 
usually requires a very good understanding of the combinatorics of the prob- 
lem together with a few ingenious insights into how the construction can be 
performed. One often needs to make a clever choice of the starting language 
L, and/or do a lot of technical work. In Section [15.2] we shall present in a 
number of examples what kind of strategies can be employed for designing a 
cross-composition. 

It is somewhat interesting that the composition framework, which is basi- 
cally the only tool that we have so far for proving kernelization lower bounds, 
seems to be unable to distinguish polynomial kernelization from the weaker 
notion of polynomial compression. It remains open whether there are natural 
problems that do admit polynomial compression algorithms, but for which 
the existence of a polynomial kernel can be refuted. 


15.1.3 AND-distillations and AND-compositions 


In the previous sections we have been always using the OR function as the 
template of behavior of distillations and compositions. It is very tempting 
to conjecture that the same results can be obtained when we substitute OR 
with AND. We could in the same manner define AND-distillation by replacing 
condition (b) of OR-distillation with a requirement that the output string y 
belongs to R if and only if allthe input strings x; belong to L. Likewise, AND- 
cross-composition is defined by replacing condition (b) of cross-composition 
by requiring that the resulting instance (y, k) belongs to Q if and only if all 
the input strings x; belong to L. 

The proof of Theorem [15.9] is oblivious to whether we use the OR func- 
tion or the AND function; the outcome of the constructed distillation will be 
just an instance of AND(R) instead of OR(R). Our arguments behind The- 
orem however, completely break when translating them to the AND 
setting. It turns out that Theorem is still true when we replace OR- 
distillations with AND-distillations, but the proof is much more difficult, and 
hence omitted in this book. 


Theorem 15.11 ([160]). Let L,R C X* be two languages. If there exists 
an AND-distillation of L into R, then L € coNP/ poly. 
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As we have already mentioned, the proof of Theorem can be trans- 
lated to the AND setting in a straightforward manner; we leave checking this 
fact as an easy exercise. Therefore, by replacing usage of Theorem [15.3] with 
Theorem [15.11] we obtain the following result. 


Theorem 15.12. Assume that an NP-hard language L AND-cross-composes 
into a parameterized language Q. Then Q does not admit a polynomial com- 
pression, unless NP C coNP/ poly. 


As an example of a problem that admits a trivial AND-cross-composition 
take the TREEWIDTH problem: given a graph G and parameter k, verify 
whether tw(G) < k. Since treewidth of a disjoint union of a family of 
graphs is equal to the maximum over treewidths of these graphs, the disjoint 
union yields an AND-cross-composition from the unparameterized version of 
TREEWIDTH into the parameterized one. Computing treewidth of a graph is 
NP-hard, so by Theorem [15.12] we infer that TREEWIDTH does not admit a 
polynomial kernel, unless NP C coNP/ poly. The same reasoning can be per- 
formed for other graph parameters that behave similarly under the disjoint 
union, for instance for pathwidth or rankwidth. 

It is noteworthy that problems known to admit AND-cross-compositions 
are much scarcer than those amenable to OR-cross-compositions. We hardly 
know any natural examples other than the aforementioned problems of com- 
puting graph parameters closed under taking the disjoint union. Therefore, 
when trying to refute the existence of a polynomial kernel for a particular 
problem, it is more sensible to first try to design an OR-cross-composition. 


15.2 Examples 


We now show four selected proofs of kernelization lower bounds, chosen to 
present different aspects of the compositions framework. We discuss instance 
selectors, a common theme in many compositions appearing in the litera- 
ture. We also present the so-called polynomial parameter transformations, 
which serve the role of hardness reductions in the world of kernelization 
lower bounds. We conclude with an example of a problem parameterized by 
some structural measure of the input graph and see that the composition 
framework works well also in such cases. 


15.2.1 Instance selector: SET SPLITTING 


Recall that, when designing a cross-composition, we are to encode a logi- 
cal OR of multiple instances £1, £2,..., £+ of the input language as a single 
parameterized instance (y,k) of the output language. Clearly, we need to 
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somehow include all input instances in (y, k), but we need also to encode the 
OR. behavior. 


A natural approach would be to design some gadget that “chooses” 
which instance x; we are going to actually solve. A bit more precisely, 
such an instance selector I’ should interact with a potential solution 
to (y, k) in t different ways, called states. In one direction, if we choose 
state 2 on I’ and solve the instance x;, we should be able to obtain a 
valid solution to (y, k). In the second direction, a valid solution to (y, k) 
that sets I’ in state ¿i should yield a solution to 2;. 

In the design of instance selectors we will often heavily rely on the 
fact that the parameter k of the output instance (y,k) may depend 
polylogarithmically on the number of input instances. In particular, the 
state i of the instance selector will often reflect the binary representation 
of the integer i. 


We now illustrate the concept of an instance selector with a simple example 
of the SET SPLITTING problem. Here, we are given a universe U and a family 
F of subsets of U; the goal is to choose a subset X C U that splits each set 
in F. We say that a set X splits a set A if both AN X £ Q and A\ X #0. We 
are interested in the parameterization k = |U|. We remark that this problem 
has a trivial brute-force FPT algorithm running in time O*(2*). 

It is not hard to prove that SET SPLITTING is NP-hard by a direct re- 
duction from the satisfiability of CNF formulas. In this section we prove the 
following. 


Theorem 15.13. There exists a cross-composition from SET SPLITTING into 
itself, parameterized by the size of the universe. Consequently, SET SPLITTING 
parameterized by |U| does not admit a polynomial compression, unless NP C 
coNP/ poly. 


Before we dive into the composition, let us comment what Theorem 
actually means: There are instances of SET SPLITTING where the family F 
is much larger than the universe U, and we do not expect that any effi- 
cient (polynomial-time) algorithm is able to sparsify such a family F without 
changing the answer in some cases. 


Proof (Proof of Theorem (15.13). By choosing an appropriate polynomial 
equivalence relation R, we may assume that we are given a family of t SET 
SPLITTING instances (F;, U;)tZ4 where |U;| = k for each i. More precisely, 
relation R considers as equivalent all the malformed instances, and all the 
well-formed instances with the same cardinality of the universe. Let us ar- 
bitrarily identify all sets U;; henceforth we assume that all input instances 
operate on the same universe U of size k. 

Note that we have numbered the input instances starting from 0: in this 
way it is more convenient to work with the binary representations of indices 7. 
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instance selector universe U 
1 + logt pairs of vertices 








Fig. 15.3: Part of the construction in the proof of Theorem |15.13] for s = 
logt = 4. The depicted sets are A? (elements in circles) and A! (elements in 
rectangles) for some 4-element set A € F; and i = 5 = 01012. 


For the same reason, we duplicate some input instances so that t = 2° for 
some integer s. Note that this step at most doubles the number of input 
instances. 

We now proceed to the construction of an instance selector. The selector 
should have t states; if it attains state i, it should make all input instances 
‘void’ except for the i-th one. We create 2(s + 1) new elements I = {df 
0<a<s,6 € {0,1}} and define Û = UUT. The new elements are equipped 
with a family F? = {{d°,d4} : 0 < a < s} of sets that force us to choose 
exactly one element from each pair {d®,d}} into the solution X. Intuitively, 
the choice of which element to include into the solution X fr 1<a< s 
corresponds to the choice of the input instance (F;,U); the additional pair 
for a = 0 is needed to break the symmetry. 

We are now ready to formally define the family F for the output instance. 
We start with Ê = FT and then, for each input instance (F;,U), we proceed 
as follows. Let i = b,b2...b, be the binary representation of the index i, 
ba € {0,1} for 1 < a < s; note that we add leading zeroes so that the length 
of the representation is exactly s. Define bọ = 0. For TE A € F; we insert 
the following two sets into the family Ê (see also Fig. |1 


A? = AU{de :0<a<s} and A'=AU {di : 0<a< s}. 


We output the instance (F, Û). Note that |U| = |U|+2(s+1) = k+2(logt+1) 
and, moreover, the construction can be performed in polynomial time. Thus, 
to finish the proof we need to show that (Ê, U) is a yes-instance of SET 
SPLITTING if and only if one of the instances (F;,U) is. 

In one direction, let X be a solution to the instance (F;, U) for some 
0<i<t. Moreover, let bjb2...b, be the binary representation of the index 
i. Define bọ = 0 and consider the set X defined as 


X=Xu{d : 0<a<s}. 
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Clearly, x splits each set in F7, as for each a, 0 < a < s, it holds that 
exactly one of elements d° and d} belongs to X and exactly one does not. 
The assumption that X is a solution to (F;,U) implies that X splits also 
each set A € F;, so in particular it splits A? and A!. Finally, take any j 4 i 
and any A € Fj. Since j ¥ i, indices i and j differ on at least one binary 
position, say a. Now observe that dj € X NA? and di-b € A°\ X, and 
d} € A! \ X and de € ALIN È. Ca x splits both A? and Al, 

In the other direction, let X be a solution to (F, Û). As U\ Xi is a solution 
as well, without loss of generality we may assume that d8 € X. Note that 
since X splits each set of F”, for each 0 < a < s exactly one of the elements 
d? and d} belongs to X and one does not. Let then by € {0,1} be an index 
such that d?e € X and di-b« ¢ X, for 0 < a < s. Note that bọ = 0. 

Let 0 < i < t be such that 6;b2...6, is the binary representation of the 
index i. We claim that X = ẸNU is a solution to the input instance (Fi; UJ: 
Consider any A € F;. Observe that Xnr=A nr and, since X splits 
A°, we have A\ X = A°\ X £ Ø. Moreover, N A! NT = @ and, since Ñ 
splits Al, we have AN X = A! N X #0. Hence X splits A and the lemma is 
proved. 














15.2.2 Polynomial parameter transformations: 
COLORFUL GRAPH MOTIF and STEINER TREE 


In the classic theory of NP-hardness we very rarely show that a particular 
problem L is NP-hard using the definition. Instead of proving from scratch 
that every problem in NP is polynomial-time reducible to L, we can ben- 
efit from the work of Cook and Levin who proved this for satisfiability of 
propositional formulas, and just construct a polynomial-time reduction from 
a problem that is already known to be NP-hard. Thus we can focus on one 
well-chosen hard problem as the source language of the reduction, rather than 
on an arbitrary problem in NP. 

In the world of kernel lower bounds, the situation is somewhat similar: 
an explicit lower bound via the composition framework does not seem as 
challenging as re-proving the Cook-Levin theorem (since we in fact reuse the 
result of Fortnow and Santhanam), but in many cases it is easier to ‘transfer’ 
hardness from one problem to another. To be able to do so, we need an 
appropriate notion of reduction. 


Definition 15.14. Let P,Q C X* x N be two parameterized problems. An 
algorithm A is called a polynomial parameter transformation (PPT, for short) 
from P to Q if, given an instance (x, k) of problem P, A works in polynomial 
time and outputs an equivalent, instance (ĉ, k) of problem Q, i.e., (x, k) € P 
if and only if (ĉ, Å) € Q, such that k < p(k) for some polynomial p(-). 
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Let us verify that the reduction defined as above indeed serves our needs. 
Assume that, apart from a PPT A from P to Q, there exists a compression 
algorithm 6 for problem Q that, given an instance (4, k), in polynomial time 
outputs an instance y of some problem R of size bounded by pp(k) for some 
polynomial pg(-). Consider pipelining algorithms A and B: given an instance 
(x,k) of P, we first use algorithm A to reduce it to an equivalent instance 
(2, k) of Q, and then use algorithm B to compress it to an equivalent in- 
stance y of R. Observe that |y| < pg(k) and k < p(k), hence |y| is bounded 
polynomially in k. Since all computations take polynomial time, we obtain 
a polynomial compression of P into R. This reasoning allows us to state the 
following. 


Theorem 15.15. Let P,Q C X* x N be two parameterized problems and 
assume there exists a polynomial parameter transformation from P to Q. 
Then, if P does not admit a polynomial compression, neither does Q. 


Hence, if we want to show kernelization hardness for problem Q, it suffices 
to ‘transfer’ the hardness from some other problem P, for which we already 
know that (probably) it does not admit a polynomial compression. We remark 
that Theorem [15.15]uses the notion of compression instead of kernelization: 
if we liked to prove an analogous statement for polynomial kernelization, we 
would need to provide a way to reduce ‘back’ Q to P (in an unparameterized 
way). This, in turn, requires some additional assumptions, for example NP- 
hardness of P and Q € NP. Luckily, the composition framework refutes 
the existence of not only polynomial kernelization, but also any polynomial 
compression, and we can use the (simpler) statement of Theorem [15.15] 

We illustrate the use of PPTs on the example of the STEINER TREE prob- 
lem. Here, given a graph G with some terminals K C V(G) and an integer 
£, one asks for a connected subgraph of G that both contains all terminals 
and has at most ¢ edges. Note that without loss of generality we may assume 
that the subgraph in question is a tree, as we care only about connectivity. 
We are interested in the parameterization by @. 

A direct composition for STEINER TREE may be quite involved and dif- 
ficult. However, we make our life much easier by choosing an appropriate 
auxiliary problem: for this problem a composition algorithm is almost trivial, 
and a PPT reduction to STEINER TREE is also relatively simple. 

The auxiliary problem that will be of our interest is called COLORFUL 
GRAPH MOTIF. In this problem we are given a graph G, an integer k, and a 
coloring c of vertices of G into k colors (formally, any function c : V(G) > 
[k]). The task is to find a connected subgraph H of G that contains exactly 
one vertex of each color (and hence has exactly k vertices). The COLORFUL 
GRAPH MOTIF problem is NP-hard even if we restrict the input graphs to 
being trees of maximum degree 3. 

Observe now that a disjoint union yields a composition algorithm for 
COLORFUL GRAPH MOTIF. More precisely, given any number of COLORFUL 
GRAPH MOTIF instances (Gj, k;,c;)!_, with the same number of colors k = 
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Fig. 15.4: Reduction from COLORFUL GRAPH MOTIF to STEINER TREE. 
Black vertices are terminals. The shape of a non-terminal vertex corresponds 
to its color in the input COLORFUL GRAPH MOTIF instance. A solution 
to COLORFUL GRAPH MOTIF with additional edges for the corresponding 
solution to STEINER TREE is highlighted with thick edges and grey nodes 


ki, the connectivity requirement ensures that an instance (Wi, Gi,k, Ui Ci) 
is a yes-instance of COLORFUL GRAPH MOTIF if and only if one of the in- 
stances (G4, ki, ci) is; here, Wi, G; denotes the disjoint union of graphs Gi. 
Hence, using the aforementioned NP-hardness of COLORFUL GRAPH MOTIF 
we may state the following. 


Theorem 15.16. COLORFUL GRAPH MOTIF cross-composes into itself pa- 
rameterized by k, the number of colors. Consequently, COLORFUL GRAPH 
MOTIF parameterized by k does not admit a polynomial compression, unless 
NP C coNP/ poly. 


Thus, we have obtained a hardness result for COLORFUL GRAPH MOTIF 
almost effortlessly. It remains to ‘transfer’ this result to STEINER TREE. 


Theorem 15.17. There exists a PPT from COLORFUL GRAPH MOTIF pa- 
rameterized by the number of colors to STEINER TREE parameterized by the 
size of the tree. Consequently, STEINER TREE parameterized by the size of 
the tree does not admit a polynomial compression, unless NP C coNP/ poly. 


Proof. Let (G,k,c) be a COLORFUL GRAPH MOTIF instance. Without loss 
of generality assume that k > 1, since otherwise we are dealing with a trivial 
yes-instance. We construct a graph G from G by creating k new terminals 
K = {t,,t2,...,t,}, one for each color, and making each terminal t; adjacent 
to c-+(i), that is, to all vertices of color i; see Fig. Let Ê= 2k — 1. We 
claim that (Ĝ, K,?) is a yes-instance of STEINER TREE if and only if (G, k, c) 
is a yes-instance of COLORFUL GRAPH MOTIF. 

In one direction, let H be a solution of the COLORFUL GRAPH MOTIF 
instance (G,k,c). Without loss of generality assume that H is a tree; hence 
|\V(H)| = k and |E(H)| = k — 1. Let v; € V(H) be the vertex of color i in 
the subgraph H. Observe that H=Hu {viti : 1<i< k} is a connected 
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subgraph of Ĝ, containing all terminals t; and having exactly 2k — 1 = é 
edges. 

In the other direction, let H be a solution of the STEINER TREE instance 
(G,K,0). As H is connected and has at most 2k — 1 edges, it contains at 
most k non-terminal vertices. Recall that we assumed k > 1. Since t; € 
V(H) for any 1 < i < k, H needs to contain a neighbor v; of t; in G. By 
construction, v; € V(G) and c(v;) = i; in particular, all the vertices v; are 
pairwise distinct. However, |V(H)| < Ê+ 1 = 2k, hence the vertex set of H i is 
exactly KU{v;, : 1<i< kh In particular, each vertex t; is of degree 1 in H 
and, consequently, H = H \ K is connected. Observe that H is a subgraph 
of G, and contains exactly one vertex v; of each color i. Hence, (G,k,c) is a 
yes-instance of COLORFUL GRAPH MOTIF, and the claim is proved. 

Finally, observe that the aforementioned construction can be performed in 
polynomial time, and Ê = 2k — 1 is bounded polynomially in k. Hence, the 
presented construction is indeed a PPT from COLORFUL GRAPH MOTIF to 
STEINER TREE. 














We remark that there are also other natural parameterizations of STEINER 
TREE: we can parameterize by |K], the number of terminals, or by 2+1—|K], 
the number of non-terminal vertices of the tree in question. While the latter 
parameterization yields a W[2|-hard problem, recall that for the parameter- 
ization by |K| nu exists an FPT algorithm with running time O*(2!*)), 
see Theorem |10.6| Since |K| < £+ 1, in Theorem [15.17] we have considered 
a weaker oarearaterization than by |K |, and so ur [L517] refutes also 
the existence of a polynomial compression for the parameterization by |K], 
under NP É coNP/ poly. 


15.2.3 A more involved one: SET COVER 


In the case of SET SPLITTING the construction of the instance selector was 
somehow natural. Here we present a more involved reduction that shows a 
lower bound for a very important problem, namely SET COVER. An input 
to SET COVER consists of a universe U, a family F of subsets of U, and an 
integer £. The goal is to choose at most £ sets from the family F that cover 
the entire universe U. That is, we are looking for a subfamily ¥ C F such 
that |X| < £ and UX =U. 

It is more convenient for us to work with a graph-theoretic formulation 
of SET COVER, namely the RED-BLUE DOMINATING SET problem (RBDS 
for short): Given a bipartite graph G with sides of the bipartition R and B 
(henceforth called red and blue vertices, respectively) and an integer @, find 
a set X C R of at most £ vertices that together dominate the entire set B, 
i.e., N(X) = B. To see that these problems are equivalent, note that given a 
SET COVER instance (U, F, £), one may set R = F, B = U, and make each 
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set A € F adjacent to all its elements; the condition |J ¥ = U translates to 
N(X) =U in the constructed graph. 

The goal of this section is to show a kernelization lower bound for SET 
COVER, parameterized by the size of the universe, |U|. Recall that for this pa- 
rameterization we have a simple FPT algorithm running in time O* (2'!) that 
performs dynamic programming on subsets of U, see Theorem |6.1} Equiva- 
lently, we are interested in RBDS, parameterized by |B]. 


Theorem 15.18. SET COVER, parameterized by the size of the universe, 
does not admit a polynomial compression unless NP C coNP/ poly. 


Our first step is to move to a variant of this problem, called COLORED 
RED-BLUE DOMINATING SET (COL-RBDS for short), where the red side 
R is partitioned into £ sets, R = Rt U R?U...U Rf, and the solution set 
X is required to contain exactly one vertex from each set Rt. We will often 
think of sets Rt, R?,...,R* as of colors, and we are looking for a colorful 
solution in a sense that the solution needs to pick exactly one vertex of each 
color. We parameterize COL-RBDS by |B| + £, that is, we add the number 
of colors to the parameter. Observe that RBDS becomes polynomial for 
£ > |B| (each vertex w € B may simply greedily pick one ‘private’ neighbor 
in R), so in this problem the parameterization by |B| + £ is equivalent to the 
parameterization by |B|. However, this is not the case for CoL-RBDS, and 
therefore we explicitly add £ to the parameter. 

We now formally show that RBDS and CoL-RBDS are equivalent. 


Lemma 15.19. RBDS, parameterized by |B|, and COL-RBDS, parameter- 
ized by |B| + £, are equivalent with respect to polynomial parameter transfor- 
mations. That is, there exists a PPT from one problem to the other, and vice 
versa. 


Proof. In one direction, observe that, given a RBDS instance (G, £), we may 
replace each vertex v € R with its £ copies {vf : 1 < j < 4} (keeping their 
neighborhood in B) and define Ri = {vf : v € R} for 1 < j < £. This 
gives a PPT from RBDS to CoL-RBDS with considered parameterizations, 
provided that < |B|. Otherwise, as discussed before, the input RBDS 
instance is polynomial-time solvable. 

The second direction is a bit more involved. Let (Ĝ, £) be an instance of 
CoL-RBDS. Create a graph G from G by adding £ new vertices w!, w?,..., wf 
to the blue side B, and making each wÍ adjacent to all the vertices of R’, 
for 1 < j < 4. We claim that (G, 2) is an equivalent instance of RBDS; as 
we parameterize COL-RBDS by |B|+ 2, this construction will yield a desired 
PPT from CoL-RBDS to RBDS. To see the claim, note that to dominate 
each vertex w/ it is sufficient and necessary to take a vertex of RÍ into the 
constructed solution X. Therefore, any solution of the RBDS instance (G, £) 
needs to contain at least one vertex from each set RÍ, and since it contains 
at most £ vertices in total, it must contain exactly one vertex from each set 
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Fig. 15.5: Gadget in the cross-composition for COL-RBDS with £ = 5 col- 
ors. The vertices from Ri are adjacent to the circle-shaped vertices, and the 
vertices from Rĝ to the square-shaped ones 


Ri. Consequently, a set X C V(G) of size at most £ is a solution of the CoL- 
RBDS instance (G, £) if and only if it is a solution of the RBDS instance 
(G, ¢), and we are done. 














Lemma [15.19] allows us to focus on COL-RBDS. The enhanced structure 
of this problem allows us to design a cross-composition, concluding the proof 
of Theorem [15.18 


Lemma 15.20. CoL-RBDS cross-composes into itself, parameterized by |B\+ 
L. 


Proof. By choosing an appropriate polynomial equivalence relation, we may 
assume that we are dealing with a sequence (Gj, ¢;)'25 of CoL-RBDS in- 
stances with an equal number of blue vertices, denoted by n, and equal bud- 
get £ = 4. For a graph G;, we denote the sides of its bipartition as B; and 
R;, and let R; = R! UR?U...U RẸ be the partition of the set of red vertices 
into £ colors. We remark that we again number the input instances starting 
from 0 in order to make use of the binary representation of the index i. 

Our first step is to construct a graph H as follows. We first take a disjoint 
union of the graphs G;, and then we arbitrarily identify all blue sides B; into 
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one set B of size n, which constitutes the blue side of H. The red side of 
H is defined as R = We R;, and partitioned into colors RÍ = We R, for 
1 < j < £. We remark that the CoL-RBDS instance (H, £) is not necessarily 
equivalent to the logical OR, of all input instances (G4, 4;): a solution to 
(H,£) may contain vertices that originate from different input instances G;. 
We now design a gadget — an instance selector in fact — that will prohibit 
such behavior. 

Let s be the smallest integer such that t < 25; note that s = O(log t). Let 
T ={df : 1<a< s, ße {0,1}}. For each 1 <i < t, let bib}...b¢ be the 
binary representation of the index i, and let y(i) = {a$ : L<a<_s}and 
TO =P \ Y). 

We define 

Ê = BU (T x {2,3,...,2}). 


That is, we make £ — 1 copies of the set I’, indexed by integers 2,3,...,£, 
and add them to the blue side of the graph H. Finally, for each 1 < i < t we 
make each vertex of R} adjacent to all the vertices of y(i) x {2,3,..., 4} and 
for each 2 < j < £ we make each vertex of Rİ adjacent to all the vertices of 
(i) x {7}; see Fig. Let us denote the resulting graph by G. We claim 
that the CoL-RBDS instance (G, £) is a yes-instance if and only if one of the 
instances (G;,¢;) is a yes-instance. As the construction can be performed in 
polynomial time, and |B| = |B| + |'|(€— 1) = O((|B| + £ logt), this claim 
would conclude the proof of the lemma. 


Before we start the formal proof, let us give some intuition. To solve 
(G, £), we need to pick a vertex x/ from each color Ri. We would like 
to ensure that whenever x! € R;, then xf € R; as well for all j. This is 
ensured by the blue vertices in I x {j}: only xt and 2? may dominate 
them, and each such vertex dominates only s = |I'|/2 vertices from 
I x {j}. Hence, to dominate the entire I’ x {j}, we need to pick a 
vertex xÍ that dominates the complement of the neighborhood of zt. 
This, in turn, is only satisfied if both x! and x originate from the same 
input instance. 


We now proceed to the formal argumentation. 

In one direction, let X = {x',a?,...,2°} be a solution to an instance 
(Gi, li), where x3 € RÍ for 1 < j < £. We claim that X is a solution to (G, £) 
as well. Clearly, it contains exactly one vertex in each set RI and B C Na(X). 
It remains to show that X dominates also the new vertices of B. To this end, 
consider an index j, where 2 < j < 4. By construction, x! is adjacent to 
y(i) x {j}, whereas xf is adjacent to F(i) x {j}. Hence, I x {j} C Ne(X). As 
the choice of j was arbitrary, we have that I x {2,3,...,@} C Ne(X), and 
X is a solution to (G, 2). 
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In the second direction, let X = {x!,x?,...,a°} be a solution in the in- 
stance (G, 0). We claim that there exists 1 < i < t such that X C R;; this 
would imply that X is a solution to (Gj, @;) as well, finishing the proof of the 
equivalence. Assume that x! € R; and x) € Ry for some 1 < j < £. Consider 
the vertices of I x {j}. By the construction, the neighbors of x! in this set 
are y(i) x {j}, the neighbors of xf in this set are 7(i’) x {j}, and no other 
vertex of X has a neighbor in I’ x {j}. Since |y(2)| = |¥(2’)| = s = |I'|/2 and 
y(t) U F(i') = T, we infer that y(i) = y(i’) and, consequently, i = 7’. This 
finishes the proof of the claim, of Lemma[15.20|and of Theorem [15.18] 














We remark here that there are other natural parameters of SET COVER 
to consider: we may parameterize it by £, or by |F| (which in the language 
of RBDS means parameterization by |R]). It is easy to see that the first pa- 
rameterization generalizes DOMINATING SET parameterized by the solution 
size, and hence is W[2]-hard. For parameterization by |F| we have a triv- 
ial FPT algorithm working in O*(2!7!) time that checks all subsets of F. It 
can be proved that also for this parameterization the existence of a polyno- 
mial kernel may be refuted under the assumption of NP É coNP/ poly; see 
Exercise [15.4 


15.2.4 Structural parameters: CLIQUE parameterized by 
the vertex cover number 


Our last example concerns the so-called structural parameters: instead of 
considering a natural parameter such as solution size, we parameterize by 
some structural measure of the input graph, such as treewidth or cliquewidth. 
In this section we focus on the classic CLIQUE problem, parameterized by the 
vertex cover number of the input graph. 

More formally, we assume that an input consists of a graph G, and integer 
£ and a vertex cover Z C V(G) of the graph G. The task is to check whether 
G contains a complete subgraph on £ vertices. We parameterize by |Z], the 
size of the vertex cover given as an input. 

One may wonder why we assume a vertex cover is given on the input: 
a second option would be to consider a variant where no vertex cover is 
present, and we only (theoretically) analyze the algorithm in terms of the 
size of minimum vertex cover of the input graph. However, as the minimum 
vertex cover problem admits an efficient 2-approximation algorithm, from the 
point of view of parameterized algorithms and polynomial kernelization these 
variants are equivalent: we may always compute an approximate vertex cover 
and use it as the input set Z. 

We remark also that CLIQUE, parameterized by the vertex cover number, 
is trivially fixed-parameter tractable: any clique in G may contain at most one 
vertex outside Z, so a brute-force algorithm may consider only 2!4!(|V(G)| — 
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Fig. 15.6: An illustration of the cross-composition for CLIQUE parameterized 
by the vertex cover number. The vertex fẹ for u = v2 is adjacent to all 
vertices in P except for the column with subscript u 


|Z| +1) possible solutions. Recall that, in contrast, the standard solution-size 
parameterization of CLIQUE yields a W[1]-hard problem. 

Our goal in this section is to show that our structural parameterization 
of CLIQUE does not admit a polynomial compression. That is, we show the 
following. 


Theorem 15.21. CLIQUE cross-composes into itself, parameterized by the 
vertex cover number. Consequently, CLIQUE parameterized by the vertex cover 
number does not admit a polynomial compression, unless NP C coNP/ poly. 


Proof. By choosing an appropriate polynomial equivalence relation, we may 
assume that we are given a sequence (Gi, ¢;)!_, of (unparameterized) CLIQUE 
instances, where n = |V(G;)| and £ = £; for all i. Without loss of generality 
assume £ < n. Let us arbitrarily identify the vertex sets of the input graphs, 
so henceforth we assume that V(G;) = V for each i, 1 < i < t. We are going 
to construct a clique instance (G,/) with vertex cover Ê of size bounded 
polynomially in n. 

The graph G will consist of two parts: one small central part, where the 
clique in question is chosen, and a large scattered part, where we choose 
in which input instance the clique resides. The scattered part will be an 
independent set: this ensures that the small central part is actually a vertex 
cover of G. 

For the scattered part, we construct simply a vertex g; for each input 
graph G;. As promised, these vertices are pairwise nonadjacent. The choice 
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of which vertex g; we include in the solution to (G, Â determines the choice 
of the instance where we actually exhibit a solution. 

For the central part, we start with creating nf vertices P = {pf : v€ 
V,1 < a < £}. We make p% adjacent to p? if and only if v 4 u and a Æ b. Note 
that this ensures that every maximum clique in Ĝ[P] has £ vertices: it contains 
exactly one vertex p% for each index a, and at most one vertex p% for each 
index v. We also create 3(5) vertices F = {f2}, fort, fix? : uve S) and 
make every two of them adjacent if they have different subscripts. Moreover, 
the vertex f2!! is adjacent to all vertices of P, whereas vertex f} is adjacent 


Uv UU 
to P\ {pf : l1<a< ¢} and fig is adjacent to P \{p% : 1<a< l}; see 
Fig 15.6] 

For each input graph i, 1 <i < t, we make g; adjacent to all the vertices 
of P. Then, for each uv € (3), we make g; adjacent to fa! if uv € E(G,), 
and to both fp“ and fpe otherwise. Finally, we set l = 1 + £+ (5). Clearly, 
the construction can be performed in polynomial time. Moreover, P U F is 
a vertex cover of G of size nl + 3(5) = O(n”). To finish the proof of the 
theorem it suffices to show that there exists an @vertex clique in G if and 
only if one of the input graphs G; contains an ¢-vertex clique. 


To give some intuition, let us first explore how an “vertex clique may 
appear in graph G. Such a clique X may contain only one vertex g;, at 
most l vertices of P (one for each index a) and at most (5) vertices of 


F (one for each wv € ans However, Ê = 1 +/+ (Os hence, X contains 
exactly one vertex gi, exactly one vertex p% for each 1 < a < £, and 
exactly one vertex among vertices fêl, f7“ or fz” for each uv € (Oye 
The choice of g; determines the instance we are actually solving, the 
choice of the vertices of P encodes a supposed clique in G;, whereas the 
vertices in F verify that the chosen vertices of P indeed correspond to 


a clique in Gj. 


Let us now proceed to the formal argumentation. 

Assume first that X = {x',2?,...,a°} induces an ¢-vertex clique in some 
input graph G;. Construct X as follows: first, take X = {gi }U{ pa : l<a< 
£} and then, for each uv € (5), add fè! to X if uv € E(G;), add fz" to X if 
uv ¢ E(G;) and u ¢ X and add f7? to X if uv ¢ E(G;), u€ X and v ¢ X. 
Note that the fact that X induces a clique implies that it is not possible 
that uv ¢ E(G;) and u,v € X and hence the presented case distinction is 
exhaustive. It is straightforward to verify that X induces a clique in G of size 
g=1+2+ (§). 

In the other direction, let X induce an Ēvertex clique in G. As discussed, 
X needs to contain exactly one vertex gi, exactly one of the vertices f2!, fr“ 
or fay for each wv € (5), and exactly one vertex p%. for each 1 < a < £, where 
the vertices x° are pairwise different. We claim that X = {x° : 1<a< ¢} 
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is an ¢-vertex clique in G;. Clearly |X| = £, so it remains to prove that G;[X] 
is a clique. 

By contrary, assume that xx? ¢ E(G;) for some 1 < a,b < £, a Æ b. By 
the construction, g; is not adjacent to fè! , and, consequently, either 


xax? 
b 
72 
OL Jagd 


a 
xox? 


belongs to Š. However, the first one is not adjacent to p%a E€ X, and 
the latter is not adjacent to pts € X. This contradicts the assumption that 
X induces a clique in G, and concludes the proof of Theorem |15.21 
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The tools that we have seen so far are only able to distinguish problems having 
polynomial kernels from those that do not admit such preprocessing routines. 
However, is it possible to prove more refined lower bounds for problems which 
actually do possess polynomial kernels? For example, in Section [2.5] we have 
learned a basic kernelization algorithm for VERTEX COVER that returns an 
instance with at most 2k vertices, which therefore can be encoded using 
O(k?) bits. At first glance, it seems implausible to obtain a kernel that can 
be encoded in a subquadratic number of bits. Can we refine our methodology 
to develop techniques for proving lower bounds of such type? 

The answer to this question is affirmative, and comes via the framework of 
so-called weak compositions. Again, the intuition is very simple. In the pre- 
vious sections we have not used the full power of the proof of Theorem [15.3] 
Intuitively, an OR-distillation algorithm has to lose some information from 
the input instances z1, £2,..., 2, even if the bitsize of the output is bounded 
by p(max!_, |z;|) -t+~© for a polynomial p(-) and any € > 0. And indeed, 
it is easy to verify that our proof of Theorem [15.3]can be adjusted even to 
this weaker notion of OR-distillation. Now assume that somebody gave us a 
version of cross-composition from some NP-hard language L to the VERTEX 
COVER problem, whose output parameter k is allowed to be roughly propor- 
tional to the square root of the number of input instances. More precisely, for 
any 6 > 0 we have some polynomial p(-) such that k < p(maxt_, |x;|) -¢274. 
If the VERTEX COVER problem admitted a compression into bitsize O(k?—©°) 
for some fixed € 9 > 0, then by choosing 6 = €9/4 and pipelinining the com- 
position with the compression, we would obtain a contradiction with the 
strengthened version of Theorem [15.3] 

This motivates the following definition. 


Definition 15.22. Let L C X* be a language and Q C &X* x N be a pa- 
rameterized language. We say that L weakly-cross-composes into Q if there 
exists a real constant d > 1, called the dimension, a polynomial equivalence 
relation R, and an algorithm A, called the weak cross-composition, satisfy- 
ing the following conditions. The algorithm A takes as input a sequence of 
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strings 21, @2,...,24 E X* that are equivalent with respect to R, runs in time 
polynomial in }7}_, |a;|, and outputs one instance (y, k) € X* x N such that: 


(a) for every ô > 0 there exists a polynomial p(-) such that for every choice of 
t and input strings £1, 22,..., £, it holds that k < p(maxt_, |x;|) -taté, 
and 

(b) (y, k) € Q if and only if there exists at least one index i such that z; € L. 


The following theorem easily follows from the described adjustments in 
the proofs of Theorem and Theorem We leave its proof as an easy 
exercise for the reader (Exercises and |15.3). 


Theorem 15.23. Assume that an NP-hard language L admits a weak cross- 
composition of dimension d into a parameterized language Q. Assume further 
that Q admits a polynomial compression with bitsize O(k’—*), for some £ > 0. 
Then NP C coNP/ poly. 


Therefore, to refute the existence of a polynomial compression for VERTEX 
COVER with strictly subquadratic bitsize, it suffices to design a weak cross- 
composition of dimension 2 from some NP-hard problem to VERTEX COVER. 
It appears that we will be able to elegantly streamline the construction by 
choosing the starting problem wisely. The problem of our interest is called 
MULTICOLORED BICLIQUE: We are given a bipartite graph G = (A, B, E) 
such that A is partitioned into k color classes A1, Ao,...,Axz, each of size n, 
and B is partitioned into k color classes B1, Bo,..., By, each of size n. The 
question is whether one can pick one vertex from each class A; and one vertex 
from each class B; so that the graph induced by the chosen vertices is iso- 
morphic to a complete bipartite graph Ky,,,. The MULTICOLORED BICLIQUE 
problem is known to be NP-hard. 


Lemma 15.24. There exists a weak cross-composition of dimension 2 from 
the MULTICOLORED BICLIQUE problem into the VERTEX COVER problem 
parameterized by the solution size. 


Proof. By choosing the polynomial equivalence relation appropriately, we can 
assume that we are given a sequence of well-formed instances I‘, I?,...,I¢ 
having the same values of k and n. More precisely, each J contains a bipartite 
graph GJ = (A’, BJ, EÏ), each A? is partitioned into classes Aj, A3,..., Az, 
each BÍ is partitioned into classes BiB, ia ., Bi, and |A2| = |B? | = n for 
each i = 1,2,...,k and j = 1,2,...,t. By duplicating some instances if 
necessary, we assume that t = s? for some integer s. 

Construct now a graph H as follows. The vertex set of H consists of 2s 
classes of vertices A!, A?,..., A’, B!, B2, ..., ÊS, each of size kn. Each class 
A® is further partitioned into classes At, Â$, ews Ae each of size n, and the 
same is performed for the classes B® as well. The intuition now is that we 
have s? input instances I’, and s? pairs (A, BJ?) that look exactly as the 
sides of the bipartition of the graph GÎ. Hence, we will pack the information 
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Fig. 15.7: The graph H in the proof of Theorem [15.23] constructed for k = 2, 
n = 3, and t = 9; the highlighted part contains the set of edges between 
Â! and Ê? that is constructed in (c) to encode the input graph G*. To 
avoid confusion, the complete bipartite graphs between every pair of classes 
Â! and between every pair of classes Ê! have been depicted symbolically as 


small K2 2s 


about the edges between AJ and BÍ from each input instance J’ into the 
edge space between Âj! and B? for a different pair of indices (41, j2). 

Let us fix an arbitrarily chosen bijection T between [s*] and [s] x [s]. The 
edge set of H is defined as follows: 


(a) For every 1 < 4 < 2 < s, introduce an edge between every vertex of 
Â“ and every vertex of Ae, and an edge between every vertex of Bo 
and every vertex of B®, 

(b) For every 1 < £ < s and every 1 <i < k, turn the sets Af and Êf into 


cliques. 
(c) For every 1 < j < t, let r(j) = (j1, j2). Fix any bijection Êj that, for each 
i = 1,2,...,k, matches vertices of A? with vertices of A?', and vertices 


of BI with vertices of BP. For every u € A! and v € BÏ, put an edge 
between ¢;(u) and ġ;(v) if and only if the edge uv is not present in G°. 


The role of edges introduced in (a) is to enforce that the solution has 
a nontrivial behavior for exactly one of the pairs (Ât, B72), which will 
emulate the choice of the instance to be satisfied. Edges introduced in 
(b) and (c) encode the instance itself. 


Let us fix the budget for the resulting VERTEX COVER instance as k = 
|V(H)| — 2k = 2snk — 2k. Since s = yt, this means that the obtained cross- 
composition will have dimension 2. We are left with formally verifying that 
the obtained instance (H,k) of VERTEX COVER is equivalent to the OR of 
the input instances of MULTICOLORED BICLIQUE. 
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Claim 15.25. If there exists an index j, 1 < j < t, such that IÍ is a yes- 
instance of MULTICOLORED BICLIQUE, then (H, k) is a yes-instance of VER- 
TEX COVER. 


Proof. Let Y’ be the vertex set of a Kp ,-biclique in GJ, where Y/ contains 
exactly one vertex from each color class A A}, sou „Al, Bi, Bi, ses , Bi. Let 
us define X = V (H) \ ¢;(YŻ). Since |Y4| = 2k, we have that |X| = k. We are 
going to show that X is a vertex cover of H, which will certify that (H, k) isa 
yes-instance. We prove an equivalent statement that Y = V(H)\X = ¢;(Y”) 
is an independent set in H. 

This, however, follows directly from the construction of H. Let (j1,j2) = 
T(j) and let us take any distinct u,v € Y. If u,v € Ad, then u and v 
have to belong to two different classes among A??, A3',... pA, and there 
is no edge between any pair of these classes. The same reasoning also holds 
when u,v € B#2. However, if u € A and v € B%, then we have that 
7 '(u)d; (v) € EY, because YI induces a biclique in G). By the construction 


j j 
of H it follows that uv ¢ E(H). 4 


Claim 15.26. If (H, k) is a yes-instance of VERTEX COVER, then there ez- 
ists an indez j, 1 < j < t, such that IY is a yes-instance of MULTICOLORED 
BICLIQUE. 


Proof. Let X be a vertex cover of H of size at most k. Hence, Y = V(H)\ X 
is an independent set in H of size at least 2k. 

We first claim that there are two indices 1 < j1,j2 < s such that Y C 
Âi U Bi. Indeed, if there were two different indices j| and j} such that 
there existed some u € Y N Ati and v € Y N AM, then by the construction 
of H we would have that uv € E(H), which contradicts the fact that Y is 
an independent set. The same reasoning can be performed for the sets Be. 
Let j = 77!(j1,j2) then; since we have that Y C A? U ÊI, we can define 
Y) = 6;'(¥Y). The goal is to show that Y? is a solution to the instance I’. 

We now observe that yj contains at most one vertex from each color class 
Aj, Aj,..., AL, Bi, B3,..., B}. This follows directly from the fact that each 
of the classes A? , AP, Fes <a ; BP ; Be, ee , BP induces a clique in H. Since 
|Y9| = |Y| > 2k and there are 2k color classes, we infer that Y? contains 
exactly one vertex from each color class. 

Finally, pick any u € Y/N. A? and v € Y/N Bi, and observe that since ġ;(u) 
and ¢;(v) are not adjacent in H, the vertices u and v are adjacent in G’. 
Consequently, YÍ induces a K;,,,-biclique in G? that contains one vertex from 
each color class, and J; is hence a yes-instance of MULTICOLORED BICLIQUE. 

a) 











Claims |15.25] and |15.26] conclude the proof of Lemma [15.24 
From Theorem }15.23/and Lemma/|15.24] we can derive the following corollary. 
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Corollary 15.27. For any £ > 0, the VERTEX COVER problem parameter- 
ized by the solution size does not admit a polynomial compression with bitsize 
O(k?-£), unless NP C coNP/ poly. 


Observe now that the lower bounds on the bitsize of a compression can be 
transferred in a similar manner via polynomial parameter transformations 
as we have seen in Section It is just the polynomial dependence of 
the output parameter on the input one that determines the strength of the 
obtained lower bound for the target problem. For instance, there is a simple 
polynomial parameter transformation that takes as input an instance (G, k) 
of VERTEX COVER and outputs an equivalent instance (G’, k) of FEEDBACK 
VERTEX SET: To obtain G”, take every edge uv € E(G) and add a vertex 
Ww that is adjacent only to u and to v, thus creating a triangle that must 
be hit by the solution. By pipelining this reduction with Corollary |15.27| we 
obtain the following result. 


Corollary 15.28. For any £ > 0, the FEEDBACK VERTEX SET problem pa- 


rameterized by the solution size does not admit a polynomial compression with 
bitsize O(k?—*), unless NP C coNP/ poly. 


As with classic cross-compositions, we can refute the possibility that an 
algorithm with a too weak computational power packs the information con- 
tained in the input instance into a too small bitsize of the output instance. 
However, again we have no control over how the information needs to be 
organized in, say, the kernel. For instance, Corollaries [15.27] and [15.28] show 
that obtaining kernels for the VERTEX COVER and FEEDBACK VERTEX SET 
problems with a strictly subquadratic number of edges is implausible (since 
O(k?~©) edges can be encoded in O(k?~* log k) bits). However, for VERTEX 
COVER we know a kernel that has a linear number of vertices, namely 2k, 
whereas no such kernel is known for FEEDBACK VERTEX SET: the best known 
one (given in Chapter [9} has O(k?) vertices. It seems hard to understand, 
using solely weak compositions as a tool, whether obtaining a kernel for 
FEEDBACK VERTEX SET with a subquadratic number of vertices is possible 
or not. 

We state the following theorem for the convenience of the reader, as it 
provides a good set of basic problems for further reductions. In the PERFECT 
d-SET MATCHING problem we are given a universe U, of size divisible by d, 
and a family F C 2U of subsets of U, each of size d. The question is whether 
there exists a subfamily G C F of size |U|/d such that sets in G are pairwise 
disjoint. 

Theorem 15.29 ([129} [130]). Let € > 0 be any constant. Unless NP C 
coNP/ poly, the following statements hold: 


(a) For any q > 3, the q-SAT problem parameterized by the number of vari- 
ables n does not have a compression with bitsize O (n15). 
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(b) For any d > 2, the d-HITTING SET problem parameterized by |U| does 
not have a compression with bitsize O(\U|*-*). 

(c) For any d > 3, the PERFECT d-SET MATCHING problem parameterized 
by |U| does not have a compression with bitsize O(\U|*-*). 


Note that for all the considered problems, already the trivial kernelization 
algorithms, which just remove duplicates of clauses or sets in the input, match 
the presented lower bounds. 

The proofs of the presented statements are more difficult, and hence omit- 
ted from this book. Note that statement (b) for d = 2 is equivalent to Corol- 
lary [15.27] The proof for d > 2 can be obtained using the same strategy, 
or derived by an easy reduction from statement (a). The known proofs of 
statement (a), which was actually the original motivation for introducing the 
weak composition framework, use quite advanced results from number theory. 
The proof of statement (c) is a fairly complicated weak cross-composition; 
however, it is self-contained and does not use any external results. 


Exercises 


15.1 (Z). In a few places (e.g., STEINER TREE parameterized by either £ or |K|) we have 
used the following argument: If a problem P admits two parameterizations kı and k2, and 
kı is always bounded polynomially in k2 in nontrivial instances, then a result that refutes 
the existence of a polynomial compression for the parameterization by k2 also does the 
same for the parameterization by ki. Prove this statement formally, using the notion of 
polynomial parameter transformation. 


15.2. Prove that Theorem holds even if the output string of an OR-distillation is 
allowed to have length at most p(max‘_, |x;|) - t!~*, for some polynomial p(-) and any 
e > 0. Would it be possible to use o(t) instead of t!—*? 


15.3 (Z). Derive Theorem|15.23|from Exercise 


15.4. In this meta-exercise you are asked to prove that the following parameterized prob- 
lems do not admit polynomial compressions unless NP C coNP/ poly. In each case, we 
give problem name, the parameter in parentheses, and the problem definition. For each 
problem, you may assume that it is NP-hard (without proof). 

Some important definitions: We say that a graph G is d-degenerate for some positive 
integer d if each subgraph of G contains a vertex of degree at most d. For example, forests 
are 1-degenerate and planar graphs are 5-degenerate. A bull is a 5-vertex graph H with 
V(H) = {a,b,c,d,e} and E(H) = {ab, bc, ac, bd, ce}. A proper coloring of a graph G with 
q colors is a function c: V(G) —> [q] such that c(w) Æ c(v) whenever uv € E(G). 


1. (Æ) DirecTeD Max Lear (k): Does there exist an outbranching with at least k leaves 
in a given directed graph G? An outbranching is a directed graph that (a) is a tree if 
we suppress the directions of the edges, and (b) is rooted at one vertex, and all edges 
are directed away from the root. 

2. (Z) Max LEAF SuBTREE (k): Does there exist a subtree of a given graph G with at 
least k leaves? 

3. CONNECTED VERTEX Cover (k): Does there exist a set X of at most k vertices in 
the given graph G, such that G[X] is connected and G \ X is edgeless? 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


. CONNECTED FEEDBACK VERTEX SET (k): Does there exist a set X of at most k 


vertices in the given graph G, such that G[X] is connected and G \ X is a forest? 


. CONNECTED BuLL Hittine (k): Does there exist a set of vertices X in the given 


graph G such that G[X] is connected and G\ X does not contain a bull as an induced 
subgraph? 


. 2-DEG CONNECTED FEEDBACK VERTEX SET (k): The same as CONNECTED FEEDBACK 


VERTEX SET, but the input graph is required to be 2-degenerate. 


. 2-DEG STEINER TREE(k): The same as STEINER TREE, but the input graph is required 


to be 2-degenerate. 


. 2-DEG CONNECTED DOMINATING SET(k): Does there exist a set of vertices X in the 


given 2-degenerate graph G such that G[X] is connected and NG[X] = V(G)? 


. SET Packine (|U|): Do there exist k pairwise disjoint sets in a given family F of 


subsets of some universe U? 

SET Cover (|F|): Do there exist £ sets in a given family F of subsets of some universe 
U that together cover the whole U? 

DIRECTED EpGE MuLTIWay CUT WITH 2 TERMINALS(k): Does there exist a set X of 
at most k edges of the given directed graph G with designated terminals s and t, such 
that in G \ X there does not exist a (directed) path from s to t nor from t to s? 
DisJoiIntT Factors (|I|): Do there exist pairwise disjoint subwords w1,u2,...,us of 
the input word w over alphabet I = {71, Y2, ..., Ys} such that each u; is of length at 
least 2 and begins and ends with q;? 

VERTEX DisJoINT Patus (k): Do there exist vertex-disjoint paths P}, P2,..., Pp in 
the input graph G with designated terminal pairs (s1,t1), (s2,t2),.-., (Sk, tk), such 
that P; connects s; with t; for each i? 

(&) Curomatic NuMBER (VC): Does there exist a proper coloring of the input graph 
G with q colors? The input consists of the graph G, the integer q and a set Z C V(G) 
that is a vertex cover of G (i.e., G\ Z is edgeless). 

($) FEEDBACK VERTEX SET (CLUSTER DisTANCE): Does there exist a set X C V(G) 
of size at most £ such that G \ X is a forest? The input consists of the graph G, the 
integer £, and a set Z C V(G) such that G\ Z is a cluster graph (i.e., each connected 
component of G \ Z is a clique). 

(&) Enae Ciiqur Cover (k): Do there exist k subgraphs K1, K2,..., Kp of the 
input graph G such that each K; is a clique and E(G) = GE E(K;)? 

SUBSET Sum (k + logm): Does there exist a subset X of at most k numbers from 
the given set S of positive integers, such that Prex x equals exactly a number m 
prescribed in the input? 

(8) s-Way Cur (k): Does there exist a set X of at most k edges of the input graph 
G, such that G \ X has at least s connected components? Both numbers s and k are 
given as input. 

(&) EULERIAN DELETION (k): Does there exist a set of at most k edges in the input 
directed graph G, such that G \ X is Eulerian (i.e., weakly connected and each vertex 
has equal in- and outdegree)? 


15.5. The Opp CYCLE TRANSVERSAL problem asks for a set X of at most k vertices of 
the input graph G such that G \ X is bipartite. Prove that, unless NP C coNP/ poly, for 
any € > 0 Opp CYCLE TRANSVERSAL does not admit a kernel with O(k?—*) edges. 


15.6. Prove that there exists a function f : Z; —> Zs with f(d) => o as d > oo such 
that if for some d > 2 there exists a kernel of size O(kf(®) for any of the following two 
problems, restricted to d-degenerate graphs, then NP C coNP/ poly: 


1. 
2. 


CONNECTED VERTEX COVER, 
DoMINATING SET. 
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Hints 


Follow the same proof strategy, but modify the choice of t. 
15.4 


Try disjoint union. 

Try disjoint union. 

Reduce from SET Cover parameterized by the size of the universe. 

Reduce from CONNECTED VERTEX COVER. 

[74 [8] Recall that COLORFUL GRAPH MotiF is NP-hard already on trees. Use this result 
to improve the kernelization lower bound for COLORFUL GRAPH MotiF and reduce 
from this problem. 

[9] Adjust the cross-composition for SET Cover parameterized by the size of the universe. 

[10] Define a colorful version of the problem and prove that it is equivalent to the original 
one. Focus on proving hardness for the colorful version. Design an instance selector 
consisting of 2logt additional sets in logt new colors. The chosen sets of these colors 
should cover completely the universes of all the input instances apart from exactly one, 
which should be covered by ‘regular’ sets. 

Compose the input instances ‘sequentially’. However, enrich the input instances be- 
fore performing the composition, in order to make sure that the solution to the com- 
posed instance will not be spread among several of the input instances, but will rather 
be concentrated in one instance. 

Start the composition with instances over the same alphabet I”, and add log t symbols 
to I’ to emulate an instance selector. 

Reduce from DisJoINT FACTORS. 

Carefully prepare your input language for composition. For example, consider the 
task of checking whether a planar graph is properly colorable with four colors, which 
is NP-hard. Note that it is definitely colorable with four colors. Design an instance 
selector that allows all input instances to use four colors, except for one instance that 
may use only three. 

Reduce from a colorful variant of the SET PACKING problem. 

Use an AND-cross-composition rather than an OR-cross-composition. You need a 
careful construction to ensure that all the edges of the resulting instance are covered. 
For this, use the following observation: If the input graph contains a simplicial vertex, 
then we can assume that any solution contains its closed neighborhood as one of the 
cliques. Thus, by adding simplicial vertices we can create cliques of edges that will be 
covered for sure. 

Make a cross-composition from the same problem, but watch out for the bound on 
the number of equivalence classes in the polynomial equivalence relation — you cannot 
directly classify the instances according to their target value m. How to go around this 
problem? Then, design an instance selector to ensure that all integers in the solution 
come from the same input instance. 

First, show that a weighted variant, where each edge has small integer weight, is 
equivalent. Then, cross-compose from CLIQUE to the weighted variant. Your intended 
solution should pick one input instance, and cut out all vertices not contained in the 
supposed clique as isolated vertices, and leave the rest of the graph as one large con- 
nected component. 

Prove NP-hardness and cross-compose the following auxiliary problem: given a di- 
rected graph G with terminals s and t and some forbidden pairs F C Ce, does there 
exist a path from s to t that, for any eje2 € F, does not contain both edges e1 and e2. 


Reduce from VERTEX COVER. 
Reduce from any of the problems considered in Theorem [15.29 
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Bibliographic notes 


For an introduction to complexity theory, and in particular to the concepts used in this 
chapter, we refer the reader to the recent book of Arora and Barak [21]. The fact that 
NP C coNP/poly implies that the polynomial hierarchy collapses to its third level is 
known as Yap’s theorem [440]. 

Theorem [15.3] was proved by Fortnow and Santhanam [213]. The first framework for 
proving kernelization lower bounds that builds upon the work of Fortnow and Santhanam 
was proposed by Bodlaender, Downey, Fellows, and Hermelin [50]. This technique, called 
OR-composition, is a slightly weaker form of the cross-composition framework that was 
presented in this chapter. More precisely, the formalism presented in does not use poly- 
nomial equivalence relations, assumes that the source and target languages of a composition 
are the same, and does not allow the output parameter to depend poly-logarithmically on 
the number of input instances. Later it turned out that the lacking features are helpful 
in streamlining compositions for many problems, and many authors developed ad hoc, 
problem-dependant methods for justifying them. The cross-composition framework, which 
was eventually proposed by Bodlaender, Jansen, and Kratsch [53], elegantly explains why 
these features can be added in general. The fact that the whole approach should work 
when the OR function is replaced with AND was initially conjectured by Bodlaender et 
al. [50]; the statement had been dubbed the AND-conjecture. The conjecture was even- 
tually resolved by Drucker [160], who proved Theorem [15.11] We note that Drucker only 
states that the existence of an AND-distillation for an NP-hard problem implies that 
NP C coNP/ poly, but the slightly stronger statement of Theorem[15.11] also follows from 
his results. A shorter proof of the result of Drucker was recently announced by Dell [128]. 

The idea of an instance selector originates in the work of Dom, Lokshtanov, and 
Saurabh [IZI]. The lower bound for SET SPLITTING was first obtained by Cygan, Pilipczuk, 
Pilipczuk, and Wojtaszczyk via a PPT from SET Cover parameterized by the size of 
the family, i.e., Exercise |15.4]10| The lower bound for STEINER TREE was first proved by 
Dom, Lokshtanov, and Saurabh [II], again via a PPT from SeT Cover parameterized 
by the size of the family. The proof using GRAPH MotiF as a pivot problem was proposed 
by Cygan, Pilipczuk, Pilipczuk, and Wojtaszczyk [121], and the main point was that the 
resulting graph is 2-degenerate. The example of SET Cover parameterized by the size of 
the universe has been taken from Dom, Lokshtanov, and Saurabh [141]; we note that the 
original composition was slightly more complicated. CLIQUE parameterized by vertex cover 
was one of the first examples of the application of the cross-composition framework, and 
was given by Bodlaender, Jansen and Kratsch [53]. For Exercise point [i] originates 
in [35], points |3} |9| [17] originate in [141], points [7} [8] originate in [127], points [11] 
[18] originate in [114], Pn fol E a in [56], point [14] originates in [276], point 
originates in [53], and point |19]originates in [117]. 

The idea of weak compositions originates in the work of Dell and van Melkebeek [130], 
who proved statements (a) and (b) of Theorem and derived Corollary from 
it. The weak composition framework, as presented in this chapter, was introduced later, 
independently by Dell and Marx and by Hermelin and Wu [260]. In particular, Dell 
and Marx proved statement (c) of Theorem and re-proved statement (b) in a 
simplified manner. The presented lower bound for kernelization of VERTEX COVER is also 
taken from [129]. 
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Appendix: Notation 


Graph notation 


We try to use as much as possible graph notation compatible with the text- 
book of Diestel [138]. Here we recall the most basic definitions. 

An undirected graph is a pair G = (V, E), where V is some set and Æ isa 
family of 2-element subsets of V. The elements of V are the vertices of G, and 
the elements of E are the edges of G. The vertex set of a graph G is denoted 
by V(G) and the edge set of G is denoted by E(G). In this book all the 
graphs are finite, i.e., the sets V(G) and E(G) are finite. We use shorthands 
n =|V(G)| and m = |E(G)| whenever their use is not ambiguous. An edge of 
an undirected graph with endpoints u and v is denoted by uv; the endpoints 
u and v are said to be adjacent, and one is said to be a neighbor of the other. 
We also say that vertices u and v are incident to edge uv. 

Unless specified otherwise, all the graphs are simple, which means that 
no two elements of E(G) are equal, and that all edges from E(G) have two 
different endpoints. If we allow multiple edges with the same endpoints (al- 
lowing E(G) to be a multiset), or edges having both endpoints at the same 
vertex (allowing elements of E(G) to be multisets of size 2 over E(G), called 
loops), then we arrive at the notion of a multigraph. Whenever we use multi- 
graphs, we state it explicitly, making clear whether multiple edges or loops 
are allowed. 

Similarly, a directed graph is a pair Œ = (V, E), where V is the set of 
vertices of G, and E is the set of edges of G, which this time are ordered 
pairs of two different vertices from V. Edges in directed graphs are often also 
called arcs. For an edge (u,v) € E(G), we say that it is directed from u to 
v, or simply that it goes from u to v. Then v is an outneighbor of u and u is 
an inneighbor of v. For an arc a = (u,v), u is the tail and v is the head of a. 
Again, we can allow multiple edges with the same endpoints, as well as edges 
with the same head as tail (loops). Thus we arrive at the notion of a directed 
multigraph. 
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For an undirected graph G and edge uv € E(G), by contracting edge uv we 
mean the following operation. We remove u and v from the graph, introduce 
a new vertex Ww, and connect it to all the vertices u or v were adjacent to. 
Note that the operation, as we defined it here, transforms a simple graph 
into a simple graph; in other words, if u and v had a common neighbor zx, 
then the new vertex Wu» will be connected to x only via one edge. We may 
also view the operation of contraction as an operation on multigraphs: every 
edge ux and vax for some x € V(G) gives rise to one new edge wy,2, and 
in particular every copy of the edge uv apart from the contracted one gives 
rise to a new loop at Ww. For us the default type of contraction is the one 
defined on simple graphs, and whenever we use the multigraph version, we 
state it explicitly. 

Graph H is a subgraph of graph G if V(H) C V(G) and E(H) C E(G). 
Similarly, graph H is a supergraph of graph G if V(H) > V(G) and E(H) 2 
E(G). For a subset S C V(G), the subgraph of G induced by S is denoted by 
G[S]; its vertex set is S and its edge set consists of all the edges of E(G) that 
have both endpoints in S. For S C V(G), we use G — S to denote the graph 
G[V \ S], and for F C E(G) by G— F we denote the graph (V(G), E(G)\ F). 
We also write G — v instead of G — {v}. The neighborhood of a vertex v in 
G is Ne(v) = {ue V : w € E(G)} and the closed neighborhood of v is 
Ne |v] = Ne(v) U {v}. For a vertex set S C V, we define Ng[S] = Uses N [u] 
and Ng(S) = Ne|[S] \ S. We denote by dg(v) the degree of a vertex v in 
graph G, which is just the number of edges incident to v. We may omit 
indices if the graph under consideration is clear from the context. A graph 
G is called r-regular if all vertices of G have degree r. For directed graphs, 
the notions of inneighborhood, outneighborhood, indegree, and outdegree are 
defined analogously. 

In an undirected graph G, a walk of length k is a nonempty sequence of 
vertices Ug,..., Ug such that for every i = 0,...,k—1 we have vjuj41 E€ E(G). 
If vo = vg, then the walk is closed. A path is a walk where no two vertices 
appear twice, and a cycle is a closed walk where no two vertices appear twice, 
apart from the vertex that appears at the beginning and at the end. We often 
think of paths and cycles as graphs induced by vertices and edges traversed 
by them. Then a path is a graph P of the form 


V(P) = {v0,01,... 0k} E(P) = {vov1, vive, ..., UR—1UK}- 


If P = vov ... vk is a path, then the graph obtained from P by adding edge 
LEX is a cycle. Again, the length of a path or cycle is equal to the cardinality 
of its edge set. We denote by dist(u,v) the distance between u and v in a 
graph G, which is the shortest length of a path between u and v. The notions 
of walks, paths, and cycles can be naturally generalized to directed graphs 
by requesting compliance with the edge directions. 

The girth of a graph G is the shortest length of a cycle in G. A Hamiltonian 
path (cycle) in a graph G is a path (cycle) passing through all the vertices of 
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G. A G graph is bipartite if V(G) can be partitioned into two subsets (A, B) 
such that every edge of E(G) has one endpoint in A and the second in B. 
Partition (A, B) is also called a bipartition of G. Equivalently, G is bipartite 
if and only if G does not have any cycle of odd length. 

An undirected graph G is connected if for every pair u,v of its vertices 
there is a path between u and v. A vertex set X C V(G) is connected if the 
subgraph G[X] is connected. 

A tree T is a connected undirected graph without cycles. A tree T can 
be rooted at a vertex r € V(T), which imposes on V(T) natural parent-child 
and ancestor-descendant relations. A forest F is an undirected graph without 
cycles; thus all the connected components of F are trees. A spanning tree T 
of a graph G is a tree such that V(T) = V(G) and E(T) C E(G). A directed 
graph G is acyclic, or a DAG, if it does not contain any directed cycles. 

A matching M in a graph G is a set of edges that pairwise do not share end- 
points. A vertex of G is saturated if it is incident to an edge in the matching. 
Otherwise the vertex is unsaturated. For a given matching M, an alternating 
path is a path in which the edges alternately belong to the matching and 
do not belong to the matching. An augmenting path is an alternating path 
that starts from and ends at an unsaturated vertex. A perfect matching is a 
matching M that saturates all the vertices of the graph, i.e., every vertex of 
the graph is an endpoint of an edge in M. 

An independent set I in a graph G is a subset of the vertex set V(G) 
such that the vertices of J are pairwise nonadjacent. A clique C in a graph 
G is a subset, of the vertex set V(G) such that the vertices of C are pairwise 
adjacent. A vertex cover X of a graph G is a subset of the vertex set V(G) 
such that X covers the edge set E(G), i.e., every edge of G has at least one 
endpoint in X. A dominating set D of a graph G is a subset of the vertex 
set V(G) such that every vertex of V(G) \ D has a neighbor in D. A proper 
coloring of a graph G assigns a color to each vertex of G in such a manner 
that adjacent vertices receive distinct colors. The chromatic number of G is 
the minimum y such that there is a proper coloring of G using x colors. 

We now define the notion of a planar graph and an embedding of a graph 
into the plane. First, instead of embedding into the plane we will equivalently 
embed our graphs into a sphere: in this manner, we do not distinguish unnec- 
essarily the outer face of the embedding. Formally, an embedding of a graph 
G into a sphere is a mapping that maps (a) injectively each vertex of G into a 
point of the sphere, and (b) each edge uv of G into a Jordan curve connecting 
the images of u and v, such that the curves are pairwise disjoint (except for 
the endpoints) and do not pass through any other image of a vertex. A face 
is a connected component of the complement of the image of G in the sphere; 
if G is connected, each face is homeomorphic to an open disc. A planar graph 
is a graph that admits an embedding into a sphere, and a plane graph is a 
planar graph together with one fixed embedding. 

The classic theorems of Kuratowski and Wagner state that a graph is pla- 
nar if and only if it does not contain K5 or K3, as a minor (equivalently, as 
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a topological minor). In Chapters [6]and [7] we briefly mention graphs embed- 
dable into more complicated surfaces than plane or sphere. We refer to the 
bibliographic notes at the end of Chapter [6] for pointers to the literature on 
this topic. 


SAT notation 


Let Vars = {%1,%2,...,%n} be a set of Boolean variables. A variable x or 
a negated variable ~x is called a literal. A propositional formula y is in 
conjunctive normal form, or is a CNF formula, if it is of the form: 


p=CyACgA...ACm.- 
Here, each C; is a clause of the form 
CSU VLN enV E 


where t are literals of some variables of Vars. The number of literals r; in a 
clause C; is called the length of the clause, and is denoted by |C;|. The size of 
formula ¢ is defined as |p| = )7i", |C;|. The set of clauses of a CNF formula 
is usually denoted by Cls. 

For q > 2, a CNF formula ¢ is in q-CNF if every clause from ọ has at 
most q literals. If p is a formula and X a set of variables, then we denote 
by y — X the formula obtained from y after removing all the clauses that 
contain a literal of a variable from X. 

For a CNF formula y on variables Vars, a truth assignment is a mapping 
w: Vars + {1L,T}. Here, we denote the false value as L, and the truth 
value as T. This assignment can be naturally extended to literals by taking 
(~nr) = 7w(x) for each x € Vars. A truth assignment Y satisfies a clause 
C of ọ if and only if C contains some literal £ with w(¢) = T; y satisfies 
formula y if it satisfies all the clauses of y. A formula is satisfiable if it is 
satisfied by some truth assignment; otherwise it is unsatisfiable. 

The notion of a truth assignment can be naturally generalized to partial 
assignments that valuate only some subset X C Vars; i.e., w is a mapping 
from X to {L, T}. Here, a clause C is satisfied by 4% if and only if C contains 
some literal @ whose variable belongs to X, and which moreover satisfies 


ve) =T. 


Problem definitions 


(p, q)- CLUSTER 


Input: 

A graph G, a vertex v € V(G) and integers 
p and q. 

Question: 

Does there exist a (p,q)-cluster containing 
v, that is, a set C C V(G) such that v € C, 
|C| <p, and d(C) < q? 


(p, q)- PARTITION 


Input: 

A graph G and integers p and q. 
Question: 

Does there exist a partition of V(G) into 
(p, q)-clusters? Here, a set C C V(G) isa 
(p, q)-cluster if |C| < p and d(C) < q. 


2-MatTROID INTERSECTION 

Input: 

A universe U, two matrices representing 
matroids M,,Mg2 over U, and an integer 
K: 

Question: 

Does there exist a set S C U of size at least 
k that is independent both in Mı and M2? 


2-SAT 

Input: 

A CNF formula y, where every clause 
consists of at most two literals. 

Question: 

Does there exist a satisfying assignment for 
p? 


2-DEGENERATE VERTEX DELETION 
Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k vertices 
of G such that G — X is 2-degenerate? 


2k x 2k BIPARTITE PERMUTATION INDEPEN- 
DENT SET 

Input: 

An integer k and a graph G with the vertex 
set [2k] x [2k], where every edge is between 
Ty = [k] x [k] and Iz = ([2k]\ [R]) x ([2k]\ [k])- 
Question: 

Does there exist an independent set 
X C h U in G that induces a permuta- 
tion of [2k]? 


3-COLORING 

Input: 

A graph G. 

Question: 

Does there exist a coloring c : V(G) > 
{1,2,3} such that c(u) Æ c(v) for every 
uv € E(G)? 


3-HITTING SET 

Input: 

A universe U, a family A of sets over U, 
where each set in A is of size at most 3, 
and an integer k. 

Question: 

Does there exist a set X C U of size at 
most k that has a nonempty intersection 
with every element of A? 
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3-MATROID INTERSECTION 

Input: 

A universe U, three matrices representing 
matroids Mj ,,M2,Mg3 over U, and an 
integer k. 

Question: 

Does there exist a set S C U of size at least 
k that is independent in every matroid M,, 
i= 1,2,3? 


3-SAT 

Input: 

A CNF formula y, where every clause 
consists of at most three literals. 
Question: 

Does there exist a satisfying assignment for 
p? 

G VERTEX DELETION 


Input: 

A graph G, an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X € G? Here, 
G denotes any graph class that is a part of 
the problem definition. 


-M ATROID INTERSECTION 

Input: 

A universe U, matrices representing 
matroids M1, M2,..., Mge over U, and an 
integer k. 

Question: 

Does there exist a set S C U of size at least 
k that is independent in every matroid M;i, 
1<i< l? 


ġ-MAXIMIZATION 


Input: 

A graph G and an integer k. 

Question: 

Does there exist a set S of at least k vertices 
of G such that (G, S) is true? Here, ¢ 
denotes any computable Boolean predicate 
that is a part of the problem definition. 


-MINIMIZATION 


Input: 

A graph G and an integer k. 

Question: 

Does there exist a set S of at most k vertices 
of G such that (G, S) is true? Here, ¢ 
denotes any computable Boolean predicate 
that is a part of the problem definition. 


Problem definitions 


d-BOUNDED-DEGREE DELETION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that the maximum 
degree of G — X is at most d? 


d-CLUSTERING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set A C (Ve) of 
size at most k such that the graph 
(V(G), (E(G) \ A) U (A \ E(G))) is a 
d-cluster graph? Here, a d-cluster graph is 
a graph that contains exactly d connected 
components, each being a clique. 


d-HiTTING SET 

Input: 

A universe U, a family A of sets over U, 
where each set in A is of size at most d, 
and an integer k. 

Question: 

Does there exist a set X C U of size at 
most k that has a nonempty intersection 
with every element of A? 


d-SET PACKING 

Input: 

A universe U, a family A of sets over U, 
where each set in A is of size at most d, 
and an integer k. 

Question: 

Does there exist a family A’ C A of k 
pairwise disjoint sets? 


k-'TREE 


A synonym for TREE SUBGRAPH IsoMoR- 
PHISM. 


kx k CLIQUE 


Input: 

An integer k and a graph G with a vertex 
set [k] x [k]. 

Question: 

Does there exist a set X C V(G) that is a 
clique in G and that contains exactly one 
element in each row {i} x [k], 1 < i < k? 


Problem definitions 


kx k HITTING SET 


Input: 

An integer k and a family A of subsets of 
[k] x [k]. 

Question: 

Does there exist a set X C [k] x [k] that has 
a nonempty intersection with every element 
of A and that contains exactly one element 
in each row {i} x [k], 1< i< k? 


k x k HITTING SET WITH THIN SETS 


Input: 

An integer k and a family A of subsets of 
[k] x [k], where every element of A contains 
at most one element in each row {i} x [k], 
1<i<k. 

Question: 

Does there exist a set X C [k] x [k] that has 
a nonempty intersection with every element 
of A and that contains exactly one element 
in each row {i} x [k], 1< i< k? 


k x k PERMUTATION CLIQUE 


Input: 

An integer k and a graph G with a vertex 
set [k] x [k]. 

Question: 

Does there exist a set X C V(G) that is a 
clique in G and that induces a permutation 
of [k]? 


k x k PERMUTATION HITTING SET 


Input: 

An integer k and a family A of subsets of 
[k] x [k]. 

Question: 

Does there exist a set X C [k] x [k] that has 
a nonempty intersection with every element 
of A and that induces a permutation of [k]? 


k x k PERMUTATION HITTING SET WITH 


THIN SETS 


Input: 

An integer k and a family A of subsets of 
[k] x [k], where every element of A contains 
at most one element in each row {i} x [k], 
L<i<k. 

Question: 

Does there exist a set X C [k] x [k] that has 
a nonempty intersection with every element 
of A and that induces a permutation of [k]? 
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k x k PERMUTATION INDEPENDENT SET 
Input: 

An integer k and a graph G with a vertex 
set [k] x [k]. 

Question: 

Does there exist a set X C V(G) that is an 
independent set in G and that induces a 
permutation of [k]? 


q- COLORING 


Input: 

A graph G. 

Question: 

Does there exist a coloring c : V(G) — [q] 
such that c(w) 4 c(v) for every w € E(G)? 


g-SAT 


Input: 

A CNF formula p, where every clause 
consists of at most q literals. 

Question: 

Does there exist a satisfying assignment for 
p? 


q-SET COVER 


Input: 

A universe U, a family F over U, where 
every element of F is of size at most q, and 
an integer k. 

Question: 

Does there exist a subfamily F’ C F of size 
at most k such that LJ F’ = U? 


r-CENTER 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that every vertex of G is 
within distance at most r from at least one 
vertex of X? 


s-Way CUT 

Input: 

A graph G and integers s and k. 
Question: 

Does there exist a set X of at most k 
edges of G such that G — X has at least s 
connected components? 
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ALMOST 2-SAT 

Input: 

A CNF formula y, where every clause 
consists of at most two literals, and an 
integer k. 

Question: 

Is it possible to make y satisfiable by 
deleting at most k clauses? 


ANNOTATED BIPARTITE COLORING 


Input: 

A bipartite graph G, two sets 
Bı, B2 C V(G), and an integer k. 
Question: 


Does there exist a set X of at most k 
vertices of G such that G — X has a proper 
2-coloring c : V(G)\ X — {1,2} (ie, 
c(u) # c(v) for every edge uv) that agrees 
with the sets Bı and Bo, that is, f(v) = i 
whenever v € B; \ X and i = 1,2? 


ANNOTATED SATISFIABLE ALMOST 2-SAT 
Input: 

A satisfiable formula y in CNF form, where 
every clause of y consists of at most two 
literals, two sets of variables VT and V+, 
and an integer k. 

Question: 

Does there exist a set X of at most k 
variables of y and a satisfying assignment 
w of p — X such that y(x) = T for every 
x € V! \ X and w(x) = L for every 
x € V+ \ X? Here, p — X denotes the 
formula y with every clause containing at 
least one variable of X deleted. 


Bar FIGHT PREVENTION 

Input: 

A bar, a list of n potential guests, for every 
pair of guests a prediction whether they 
will fight if they are admitted together to 
the bar, and an integer k. 

Question: 

Can you identify a set of at most k trou- 
blemakers, so that if you admit to the bar 
everybody except for the troublemakers, 
no fight breaks out among the admitted 
guests? 


Problem definitions 


BIPARTITE MATCHING 

Input: 

A bipartite graph G and an integer k. 
Question: 

Does there exist an edge set S C E(G) of 
size at least k such that no two edges in S 
share an endpoint? 


CNF-SAT 

Input: 

A formula » in conjunctive normal form 
(CNF). 

Question: 

Does there exist a satisfying assignment for 
p? 


CHORDAL COMPLETION 

Input: 

A graph G and an integer k. 

Question: 

Can one add at most k edges to G to turn 
it into a chordal graph? 


CHROMATIC NUMBER 

Input: 

A graph G. 

Question: 

What is the minimum integer q, such that 
there exists a coloring c : V(G) —> [q] 
satisfying c(u) # c(v) for every edge 
uv € E(G)? 


CLIQUE 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set of k vertices of G 
that is a clique in G? 


CLIQUE}og 


Input: 

A graph G and an integer k such that 
k < logy |V(G)|. 

Question: 

Does there exist a set of k vertices of G 
that is a clique in G? 


Problem definitions 


CLOSEST STRING 

Input: 

A set of k strings 71,...,2,%, each string 
over an alphabet X and of length L, and an 
integer d. 

Question: 

Does there exist string y of length L over X 
such that dy (y, xi) < d for every 1 < i < k. 
Here, dy(x,y) is the Hamming distance 
between strings x and y, that is, the number 
of positions where x and y differ. 


CLOSEST SUBSTRING 

Input: 

A set of k strings 21, ..., &k Over an 
alphabet X, and integers L and d 
Question: 

Does there exist a string s of length L such 
that, for every 1 < i < k, x; has a substring 
x’ of length L with dy(s,s;) < d. Here, 
d(x,y) is the Hamming distance between 
strings x and y, that is, the number of 
positions where x and y differ. 


CLUSTER EDITING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set A C (VO) of 
size at most k such that the graph 
(V(G), (E(G) \ A) U(A\ E(G))) is a cluster 
graph? Here, a cluster graph is a graph 
where every connected component is a 
clique. 


CLUSTER VERTEX DELETION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is a cluster 
graph? Here, a cluster graph is a graph 
where every connected component is a 
clique. 


CocHROMATIC NUMBER 

Input: 

A graph G. 

Question: 

What is the minimum integer q, such that 
there exists a coloring c : V(G) — [q] 
satisfying c(u) # c(v) for every pair of 
nonadjacent vertices u,v? 
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COLORED RED-BLUE DOMINATING SET 
Input: 

A bipartite graph G with bipartition classes 
Rw B=V(G), an integer £ and a partition 
of R into £ sets R1, R?,..., RE. 

Question: 

Does there exist a set X C R that con- 
tains exactly one element of every set Rt, 
1<i< £and such that Ne(X) = B? 


COLORFUL GRAPH MOTIF 


Input: 

A graph G, an integer k and a coloring 
c: V(G) > [k]. 

Question: 


Does there exist a connected subgraph of 
G that contains exactly one vertex of each 
color? 


COMPONENT ORDER INTEGRITY 

Input: 

A graph G and two integers k and £. 
Question: 

Does there exist a set X of at most k 
vertices of G such that every connected 
component of G — X contains at most £ 
vertices? 


CONNECTED BULL HITTING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G[X] is connected 
and G — X does not contain a bull as an 
induced subgraph? Here, a bull is a 5-vertex 
graph H with V(H) = {a,b,c,d,e} and 
E(H) = {ab, bc, ac, bd, ce}. 


CONNECTED DOMINATING SET 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G[X] is connected 
and Ng[|X] = V(G)? 


CONNECTED FEEDBACK VERTEX SET 
Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G[X] is connected 
and G — X is a forest? 
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CONNECTED VERTEX COVER 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G[X] is connected 
and G — X is edgeless? 


CYCLE PACKING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist in G a family of k pairwise 
vertex-disjoint cycles? 


DIGRAPH Pair CUT 

Input: 

A directed graph G, a designated vertex 
s € V(G), a family of pairs of vertices 
Fc (YO), and an integer k. 

Question: 

Does there exist a set X of at most k edges 
of G, such that for each pair {u,v} € F, 
either u or v is not reachable from s in the 
graph G — X. 


DIRECTED EDGE MULTICUT 

Input: 

A directed graph G, a set of pairs (s;, ti)£ 1 
of vertices of G, and an integer k. 
Question: 

Does there exist a set X of at most k edges 
of G such that for every 1 < i < £, vertex ti 
is not reachable from vertex s; in the graph 
G— x? 


DIRECTED EDGE MuLTIway CUT 

Input: 

A directed graph G, a set T C V(G), and 
an integer k. 

Question: 

Does there exist a set X of at most k 
edges of G such that for every two distinct 
vertices t1,t2 € T, vertex t2 is not reachable 
from vertex tı in the graph G — X? 


DIRECTED FEEDBACK ARC SET 

Input: 

A directed graph G and an integer k. 
Question: 

Does there exist a set X of at most k edges 
of G such that G — X is acyclic? 


Problem definitions 


DIRECTED FEEDBACK ARC SET COMPRES- 
SION 

Input: 

A directed graph G, a set W C V(G) such 
that G — W is acyclic, and an integer k. 
Question: 

Does there exist a set X of at most k edges 
of G such that G — X is acyclic? 


DIRECTED FEEDBACK VERTEX SET 
Input: 

A directed graph G and an integer k. 
Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is acyclic? 


DIRECTED Max LEAF 

Input: 

A directed graph G and an integer k. 
Question: 

Does there exist an out-tree in G with 
at least k leaves? Here, an out-tree is a 
directed graph whose underlying undirected 
graph is a tree, and every vertex of the 
out-tree, except for one, has indegree 
exactly one; a leaf of an out-tree is a vertex 
with outdegree zero. 


DIRECTED STEINER TREE 

Input: 

A directed graph G, a designated root 
vertex r, a set T C V(G), and an integer £. 
Question: 

Does there exist a subgraph H of G with at 
most £ edges, such that every vertex t € T 
is reachable from r in H? 


DIRECTED VERTEX MULTIWAY CUT 
Input: 

A directed graph G, a set T C V(G), and 
an integer k. 

Question: 

Does there exist a set X C V(G) \ T of size 
at most k such that for every two distinct 
vertices t1,t2 € T, vertex t2 is not reachable 
from vertex tı in the graph G — X? 


Problem definitions 


DISJOINT FACTORS 


Input: 

A word w over an alphabet r = 
{11 V2 Ys} 

Question: 


Does there exist pairwise disjoint subwords 
U1,U2,---,Us Of w such that each u; is of 
length at least two and begins and ends 
with yi? 


DisJOINT FEEDBACK VERTEX SET 

Input: 

A graph G, a set W C V(G) such that 
G — W is a forest, and an integer k. 
Question: 

Does there exist a set X C V(G) \ W of 
size at most k such that G — X is a forest? 


DISJOINT FEEDBACK VERTEX SET IN 
TOURNAMENTS 

Input: 

A tournament G, a set W C V(G) such 
that G — W is acyclic, and an integer k. 
Question: 

Does there exist a set X C V(G) \ W of 
size at most k such that G — X is acyclic? 


DISJOINT ODD CYCLE TRANSVERSAL 
Input: 

A graph G, a set W C V(G) such that 
G — W is bipartite, and an integer k. 
Question: 

Does there exist a set X C V(G)\W of size 
at most k such that G — X is bipartite? 


DisJOINT PLANAR VERTEX DELETION 
Input: 

A graph G, a set W C V(G) such that 
G — W is planar, and an integer k. 
Question: 

Does there exist a set X C V(G) \ W of 
size at most k such that G — X is planar? 


DISJOINT VERTEX COVER 

Input: 

A graph G, a set W C V(G) such that 
G — W is edgeless, and an integer k. 
Question: 

Does there exist a set X C V(G) \ W of 
size at most k such that G — X is edgeless? 
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DISTORTION 

Input: 

A graph G and an integer d. 

Question: 

Does there exist an embedding n : V (G) > 
Z such that for every u,v € V(G) we have 
diste (u,v) < |n(u) —n(v)| < d- dista(u, v)? 


DOMINATING SET 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that Ng[|X] = V(G)? 


DOMINATING SET ON TOURNAMENTS 
Input: 

A tournament G and an integer k. 
Question: 

Does there exist a set X of at most k ver- 
tices of G such that for every v € V(G)\ X 
there exists u € X with (u,v) € E(G)? 


DOMINATING SET WITH PATTERN 

Input: 

A graph G, an integer k, and a graph H on 
vertex set [k] 

Question: 


Does there exist a tuple (v1,v2,...,UK) 
of distinct vertices of G such that 
Ne Hvi, v2, see URS] = V(G) and 
vivj E€ E(G) if and only if ij € E(H)? 
DUAL-COLORING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a coloring c : V(G) > 


[n — k] such that c(w) 4 c(v) for every edge 
uv? 


Ed-HITTING SET 

Input: 

A universe U, a family A of sets over U, 
where each set in A is of size exactly d, and 
an integer k. 

Question: 

Does there exist a set X C U of size at 
most k that has a nonempty intersection 
with every element of A? 
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Ed-Set PACKING 

Input: 

A universe U, a family A of sets over U, 
where each set in A is of size exactly d, and 
an integer k. 

Question: 

Does there exist a family A’ C A of k 
pairwise disjoint sets? 


EDGE BIPARTIZATION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k edges 
of G such that G — X is bipartite? 


EDGE CLIQUE COVER 


Input: 

A graph G and an integer k. 

Question: 

Does there exist k subgraphs 
Ay, H2,..., Hk of G, such that each 


H; is a clique and E(G) = U*_, E(Hi)? 


EDGE DISJOINT CYCLE PACKING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist in G a family of k pairwise 
edge-disjoint cycles? 


EDGE DOMINATING SET 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k edges 
of G such that G — V(X) is edgeless? 


EDGE MULTICUT 

Input: 

A graph G, a set of pairs (s;,ti)f_, of 
vertices of G, and an integer k. 

Question: 

Does there exist a set X of at most k 
edges of G such that for every 1 < i < £, 
vertices s; and t; lie in different connected 
components of G — X? 


Problem definitions 


EpGE Muttiway CUT 

Input: 

A graph G, a set T C V(G), and an integer 
k. 

Question: 

Does there exist a set X of at most k edges 
of G such that every element of T lies in a 
different connected component of G — X? 


Epce Muttiway CUT FOR SETS 

Input: 

A graph G, pairwise disjoint sets Ti, ..., 
Tp of vertices of G, and an integer k. 
Question: 

Does there exist a set X of at most k edges 
of G such that for every 1 < i < j < p there 
is no path between any vertex of T; and 
any vertex of Tj in G — X? 


EULERIAN DELETION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k edges 
of G such that G — X contains an Euler 
tour? 


EvEN SET 

Input: 

A universe U, a family F of subsets of U, 
and an integer k. 

Question: 

Does there exist a nonempty set X C U of 
size at most k such that |AN X| is even for 
every A E€ F? 


Exact CNF-SAT 

Input: 

A formula y in CNF and an integer k. 
Question: 

Does there exist an assignment ~ that 
satisfies y and sets to true exactly k 
variables? 


Exact Even SET 

Input: 

A universe U, a set F of subsets of U, and 
an integer k. 

Question: 

Does there exist a nonempty set X C U of 
size exactly k such that |AN X| is even for 
every AG F? 


Problem definitions 


Exact Opp SET 

Input: 

A universe U, a set F of subsets of U, and 
an integer k. 

Question: 

Does there exist a nonempty set X C U of 
size exactly k such that |AN X| is odd for 
every AG F? 


Exact UNIQUE HITTING SET 

Input: 

A universe U, a set A of subsets of U, and 
an integer k. 

Question: 

Does there exist a set X C U of size exactly 
k such that |AN X| = 1 for every A € A? 


FAST 


An abbreviation for FEEDBACK ARC SET 
IN TOURNAMENTS. 


FVST 


An abbreviation for FEEDBACK WERTEX 
SET IN TOURNAMENTS. 


FacE COVER 

Input: 

A graph G embedded on a plane and an 
integer k. 

Question: 

Does there exist a set X of at most k faces 
of the embedding of G such that every 
vertex of G lies on at least one face of X? 


FEEDBACK ARC SET IN TOURNAMENTS 
Input: 

A tournament G and an integer k. 
Question: 

Does there exist a set X of at most k edges 
of G such that G — X is acyclic? 


FEEDBACK VERTEX SET 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is a forest? 


FEEDBACK VERTEX SET IN TOURNAMENTS 


Input: 

A tournament G and an integer k. 
Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is acyclic? 
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FEEDBACK VERTEX SET IN TOURNAMENTS 
COMPRESSION 

Input: 

A tournament G, a set W C V(G) such 
that G — W is acyclic, and an integer k. 
Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is acyclic? 


GRAPH GENUS 

Input: 

A graph G and an integer g. 

Question: 

Can G be embedded on a surface of Euler 
genus g? 


GRAPH ISOMORPHISM 
Input: 

Two graphs G and H. 
Question: 

Are G and H isomorphic? 


GRAPH MOTIF 

Input: 

A graph G, an integer k, a coloring 
c : V(G) > [k], and a multiset M with 
elements from [k]. 

Question: 

Does there exist a set X C V(G) such 
that G[X] is connected and the multiset 
{c(u) : u € X} equals M? 


GRID TILING 

Input: 

An integer k, an integer n, and a collection 
S of k? nonempty sets Sij C [n] x [n] 
G <ij <k). 

Question: 

Can one choose for each 1 < i,j < k a pair 
Si j E Sij such that 


e Ifs; j= (a,b) and s;41,; = (a’,b’), then 
a=d. 


e Ifs; j= (a,b) and si j+1 = (a’,b’), then 
b=b. 
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Grip TILING WITH < 


Input: 

An integer k, an integer n, and a collection 
S of k? nonempty sets Sij C [n] x [n] 
(1<i,j <k). 

Question: 

Can one choose for each 1 < i,j < k a pair 
Sij E Sij such that 


e Ifs; j= (a,b) and s;41,; = (a',b'), then 
a<a’. 

e Ifs; j= (a,b) and si j+1 = (a’,b’), then 
b< V. 


HALL SET 

Input: 

A bipartite graph G with bipartition classes 
AW B = V(G) and an integer k. 
Question: 

Does there exist a set X C A of size at 
most k such that |Ne(S)| < |S|? 


HALTING 

Input: 

A description of a deterministic Turing 
machine M, an input word zx. 

Question: 

Does M halt on «? 


HAMILTONIAN CYCLE 

Input: 

A graph G. 

Question: 

Does there exist a simple cycle C in G such 
that V (C) = V (G)? 


HAMILTONIAN PATH 

Input: 

A graph G. 

Question: 

Does there exist a simple path P in G such 
that V (P) = V(G)? 


HITTING SET 

Input: 

A universe U, a family A of sets over U, 
and an integer k. 

Question: 

Does there exist a set X C U of size at 
most k that has a nonempty intersection 
with every element of A? 


Problem definitions 


IMBALANCE 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a bijective function 


a: V(G) > {1,2,...,|V(G)|} (called an 
ordering) whose imbalance is at most k? 
Here, the imbalance of a vertex v € V(G) 
in an ordering 7 is defined as 


tx(v) = ||{u E Ne(v) : mu) < m(v)}|- 
l{u € Ne(v) : m(u) > r(v)} II, 


and the imbalance of an ordering m is de- 


fined as 
um) = $ rt) 


veVv(G) 


INDEPENDENT DOMINATING SET 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G[X] is edgeless and 
Ne [|X] = V(G)? 


INDEPENDENT FEEDBACK VERTEX SET 
Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G[X] is edgeless and 
G — X is a forest? 


INDEPENDENT SET 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G[X] is edgeless? 


INDUCED MATCHING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of exactly 2k 
vertices of G such that G[X] is a matching 
consisting of k edges? 


Problem definitions 


INTEGER LINEAR PROGRAMMING 

Input: 

Integers m and p, a matrix A € Z™ XP, and 
vectors b € Z™, c € DP. 

Question: 

Find a vector x € ZP that satisfies Ax < b 
and that minimizes the value of the scalar 
product c- x. 


INTEGER LINEAR PROGRAMMING FEASIBIL- 
ITY 


Input: 

Integers m and p, a matrix A € Z™ XP, and 
a vector b € Z™. 

Question: 

Does there exist a vector x € ZP such that 
Ax <b? 


LINEAR PROGRAMMING 

Input: 

Integers m and p, a matrix A € R™*P, and 
vectors b € R”, c e R?. 

Question: 

Find a vector x € R? that satisfies Ax < b 
and that minimizes the value of the scalar 
product c- x. 


LINKLESS EMBEDDING 


Input: 

A graph G and an integer k. 

Question: 

Does there exist an embedding of G into 
R3 such that any pairwise linked family 
of cycles in G has size at most k? Here, 
we say that two vertex-disjoint cycles of G 
are linked in an embedding into R3 if they 
cannot be separated by a continuous defor- 
mation (i.e., they look like two consecutive 
links of a chain). 


List COLORING 

Input: 

A graph G and a set L(v) for every 
v E V(G) 

Question: 

Can one choose a color c(v) € L(v) for 
every v € V(G) such that c(u) 4 c(v) for 
every uv € E(G)? 


Lone DIRECTED CYCLE 

Input: 

A directed graph G and an integer k. 
Question: 

Does there exist a directed cycle in G of 
length at least k? 
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Lone INDUCED PATH 

Input: 

A graph G and an integer k. 

Question: 

Does there exist an induced subgraph of G 
that is isomorphic to a path on k vertices? 


LONGEST COMMON SUBSEQUENCE 


Input: 

Two strings a and b. 

Question: 

What is the length of the longest common 
subsequence of a and b? That is, we ask for 
the largest possible integer n for which there 
exist indices 1 < ių < ig < ... < in < lal 
and 1 < jı < jo <... < jn < |b| such that 
aļlir] = b[j,] for every 1 < r <n. 


LONGEST CYCLE 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a cycle in G of length at 
least k? 


LONGEST PATH 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a path in G consisting of k 
vertices? 


MATROID Ed-SET PACKING 

Input: 

An integer k and a matrix M representing 
a matroid M of rank d- k over a universe 
U, and a family A of sets over U, where 
each set in A is of size exactly d. 
Question: 

Does there exist a family A’ C A of k 
pairwise disjoint sets, such that U,e 4A 
is independent in M? 


MATROID PARITY 

Input: 

A matrix A representing a matroid M over 
a universe U of size 2n, a partition of U 
into n pairs P|, P2,..., Pn, and an integer 
k. 

Question: 

Does there exist an independent set X in 
M of size 2k that is a union of k pairs P;? 
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Max LEAF SPANNING TREE 


Input: 

A graph G and an integer k. 

Question: 

Does there exist a spanning tree of G with 
at least k leaves? 


Max LEAF SUBTREE 

Input: 

A graph G and an integer k. 

Question: 

Does G contain a tree with at least k leaves 
as a subgraph? 


Max-r-SAT 

Input: 

A CNF formula y, where every clause 
consists of at most r literals, and an integer 
k. 

Question: 

Does there exist an assignment y that 
satisfies at least k clauses of y? 


Max-Er-SAT 

Input: 

A CNF formula y, where every clause 
consists of exactly r literals with pairwise 
different variables, and an integer k. 
Question: 

Does there exist an assignment y that 
satisfies at least k clauses of y? 


MAx-INTERNAL SPANNING TREE 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a spanning tree of G with 
at least k internal vertices? 


MaxCut 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a partition A W B of V(G) 
such that at least k edges of G have one 
endpoint in A and the second endpoint in 
B? 

MAXIMUM BISECTION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a partition A W B of V(G) 
such that ||A] — |B|| < 1 and at least k 
edges of G have one endpoint in A and the 
second endpoint in B? 


Problem definitions 


MAXIMUM CYCLE COVER 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a family C of pairwise 
vertex-disjoint cycles in G such that |C| > k 
and every vertex of G lies on some cycle of 
C? 


MAXIMUM FLOW 

Input: 

A directed graph G with edge capacity 
function c : E(G) > Rso and two desig- 
nated vertices s,t € V (G). 

Question: 

Find a maximum flow from s to t. That is, 
find a function f : E(G) + R>o that satis- 
fies the capacity constraints 0 < f(e) < c(e) 
for every e € E(G), flow conserva- 
tion constraints X wuce) f(v,u) = 
Z (u,ve E(G) f(u,v) for every v E€ 
V(G) \ {s,t}, and that maximizes 
Z (s,ujeE(G) F(s,u) — Z (u,s)€B(G) f(u, s). 


MAXIMUM MATCHING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist an edge set S C E(G) of 
size at least k such that no two edges in S 
share an endpoint? 


MAXIMUM SATISFIABILITY 

Input: 

A CNF formula y and an integer k. 
Question: 

Does there exist an assignment ~ that 
satisfies at least k clauses of y? 


Min-2-SAT 

Input: 

A CNF formula p, where every clause 
consists of at most two literals, and an 
integer k. 

Question: 

Does there exist an assignment % that 
satisfies at most k clauses of y? 


Problem definitions 


Min-OnEs-2-SAT 

Input: 

A CNF formula y, where every clause 
consists of at most two literals, and an 
integer k. 

Question: 

Does there exist an assignment y that 
satisfies y and sets at most k variables to 
true? 


Min-OnEs-r-SAT 

Input: 

A CNF formula y, where every clause 
consists of at most r literals, and an integer 
k. 

Question: 

Does there exist an assignment y that 
satisfies y and sets at most k variables to 
true? 


MINIMUM BISECTION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a partition A W B of V (G) 
such that || A| — |B|| < 1 and at most k 
edges of G have one endpoint in A and the 
second endpoint in B? 


MINIMUM MAXIMAL MATCHING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist an inclusion-wise maximal 
matching in G that consists of at most k 
edges? 


MULTICOLORED BICLIQUE 


Input: 

A bipartite graph G with bipartition classes 
AWB = V (G), an integer k, a partition of A 
into k sets A1, A2,..., Ak, and a partition 
of B into k sets By, B2,..., Bk. 

Question: 

Does there exist a set X C AUB that 
contains exactly one element of every set 
A; and B;, 1 < i < £ and that induces a 
complete bipartite graph Kk, in G? 
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MULTICOLORED CLIQUE 

Input: 

A graph G, an integer k, and a partition of 
V(G) into k sets Vi, V2,..., Vk. 

Question: 

Does there exist a set X C V(G) that 
contains exactly one element of every set V; 
and that is a clique in G? 


MULTICOLORED INDEPENDENT SET 
Input: 

A graph G, an integer k, and a partition of 
V(G) into k sets Vi, V2,..., Vp. 

Question: 

Does there exist a set X C V(G) that 
contains exactly one element of every set V; 
and that is an independent set in G? 


MUuLTICUT 


A synonym for VERTEX MULTICUT. 


NAE-SAT 

Input: 

A CNF formula ¢. 

Question: 

Does there exist an assignment w such that 
for every clause C in y, at least one literal 
of C is evaluated to true and at least one 
literal is evaluated to false by w? 


Opp CYCLE TRANSVERSAL 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is bipartite? 


ODD SET 

Input: 

A universe U, a set F of subsets of U, and 
an integer k. 

Question: 

Does there exist a nonempty set X C U of 
size at most k such that |AN X| is odd for 
every AG F? 


PARTIAL DOMINATING SET 

Input: 

A graph G and integers k and r. 
Question: 

Does there exist a set X of at most k 
vertices of G such that |NG[X]| > r? 
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PARTIAL VERTEX COVER 

Input: 

A graph G and integers k and r. 
Question: 

Does there exist a set X of at most k 
vertices of G such that at least r edges of G 
are incident to at least one vertex of X? 


PARTITIONED CLIQUE 


A synonym for MULTICOLORED CLIQUE. 


PERFECT d-SET MATCHING 

Input: 

A universe U, a family A of sets over U, 
where each set in A is of size exactly d, and 
an integer k. 

Question: 

Does there exist a family A’ C A of k 
pairwise disjoint sets such that LJ A’ = U? 


PERFECT CODE 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that for every u € V(G) 
there exists exactly one vertex v € X for 
which u € Ne|v]? 


PERMUTATION COMPOSITION 

Input: 

A family P of permutations of a universe 
U, additional permutation m of U, and an 
integer k. 

Question: 

Does there exist a sequence 7, 72,.. 
P such that 7 = 71 0720...07 4%? 
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PLANAR 3-COLORING 

Input: 

A planar graph G. 

Question: 

Does there exist a coloring c : V(G) > 
{1,2,3} such that c(u) 4 c(v) for every 
uv € E(G)? 


Problem definitions 


PLANAR 3-SAT 

Input: 

A CNF formula vy, such that every clause 
of y consists of at most three literals and 
the incidence graph of y is planar. Here, 
an incidence graph of a formula ọ is a 
bipartite graph with vertex sets consisting 
of all variables and clauses of y where a 
variable is adjacent to all clauses it appears 
in. 

Question: 

Does there exist a satisfying assignment for 
p? 


PLANAR VERTEX DELETION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is planar? 


PLANAR DELETION COMPRESSION 

Input: 

A graph G, a set W C V(G) such that 
G — W is planar, and an integer k. 
Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is planar? 


PLANAR DIAMETER IMPROVEMENT 
Input: 

A planar graph G and an integer d. 
Question: 

Does there exist a supergraph of G that is 
planar and has diameter at most d? 


PLANAR FEEDBACK VERTEX SET 

Input: 

A planar graph G and an integer k. 
Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is a forest? 


PLANAR HAMILTONIAN CYCLE 

Input: 

A planar graph G. 

Question: 

Does there exist a simple cycle C in G such 
that V (C) = V (G)? 


Problem definitions 


PLANAR LONGEST CYCLE 

Input: 

A planar graph G and an integer k. 
Question: 

Does there exist a cycle in G of length at 
least k? 


PLANAR LONGEST PATH 

Input: 

A planar graph G and an integer k. 
Question: 

Does there exist a path in G consisting of k 
vertices? 


PLANAR VERTEX COVER 

Input: 

A planar graph G, an integer k. 
Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is edgeless? 


Point LINE COVER 

Input: 

A set P of points in the plane and an 
integer k. 

Question: 

Does there exist a family L of at most k 
lines on the plane such that every point in 
P lies on some line from L? 


POLYNOMIAL IDENTITY TESTING 

Input: 

Two polynomials f and g over a field F, 
given as arithmetic circuits with addition, 
subtraction and multiplication gates. 
Question: 

Is it true that f(x) = g(x) for every x € F? 


PSEUDO ACHROMATIC NUMBER 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a partition of V (G) into 
k sets Vi,V2,...,V~ such that for every 
1<i< j < k there exists at least one 
edge of G with one endpoint in V; and the 
second endpoint in Vj? 


RAMSEY 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of exactly k 
vertices of G such that GLX] is a clique or 
G[X] is edgeless? 
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ReED-BLUE DOMINATING SET 

Input: 

A bipartite graph G with bipartition classes 
Rw B=V(G) and an integer k. 
Question: 

Does there exist a set X C R of size at 
most k such that Ne(X) = B? 


SATELLITE PROBLEM 

Input: 

A graph G, integers p and q, a vertex 
v € V(G), and a partition Vo, Vi,...,Vr of 
V(G) such that v € Vo and there is no edge 
between V; and Vj for any 1 <i<j<r. 
Question: 

Does there exist a (p, q)-cluster C satisfying 
Vo C C such that for every 1 < i < r, 
either C N Vi = Ø or V; C C? Here, a set 
C C V(G) is a (p,q)-cluster if |C| < p and 
d(C) < q. 


SCATTERED SET 

Input: 

A graph G and integers r and k. 
Question: 

Does there exist a set X of at least k 
vertices of G such that distg(u,v) > r for 
every distinct u,v € X? 


SET COVER 

Input: 

A universe U, a family F over U, and an 
integer k. 

Question: 

Does there exist a subfamily F’ C F of size 
at most k such that LJ F’ = U? 


SET PACKING 

Input: 

A universe U, a family A of sets over U, 
and an integer k. 

Question: 

Does there exist a family A’ C A of k 
pairwise disjoint sets? 


SET SPLITTING 

Input: 

A universe U and a family F of sets over U. 
Question: 

Does there exist a set X C U such that 
ANX #0 and A\ X #90 for every A E€ F? 
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SHORT TURING MACHINE ACCEPTANCE 
Input: 

A description of a nondeterministic Turing 
machine M, a string x, and an integer k. 
Question: 

Does there exist a computation path of M 
that accepts x on at most k steps? 


Skew EDGE MULTICUT 

Input: 

A directed graph G, a set of pairs (s;, t;)£ 1 
of vertices of G, and an integer k. 
Question: 

Does there exist a set X of at most k edges 
of G such that for every 1 <i <j <4, 
vertex tj is not reachable from vertex s; in 
the graph G — X? 


SPECIAL DisJoINT FVS 

Input: 

A graph G, a set W C V(G) such that 
G — W is edgeless and every vertex of 
V(G) \ W is of degree at most three in G, 
and an integer k. 

Question: 

Does there exist a set X C V(G) \ W of 
size at most k such that G — X is a forest? 


SPLIT EDGE DELETION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k edges 
of G such that G—X is a split graph? Here, 
a split graph is a graph whose vertex set can 
be partitioned into two parts, one inducing 
a clique and one inducing an independent 
set. 


SPLIT VERTEX DELETION 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is a split 
graph? Here, a split graph is a graph whose 
vertex set can be partitioned into two 
parts, one being a clique and one being an 
independent set. 


Problem definitions 


STEINER TREE 

Input: 

A graph G, a set K C V(G), and an integer 
k. 

Question: 

Does there exist a connected subgraph of G 
that contains at most k edges and contains 
all vertices of K? 


STRONGLY (CONNECTED STEINER SUB- 


GRAPH 
Input: 

A directed graph G, a set K C V(G), and 
an integer k. 

Question: 

Does there exist a strongly connected 
subgraph of G that contains at most k 
edges and contains all vertices of K? 


SUBGRAPH ISOMORPHISM 

Input: 

Two graphs G and H. 

Question: 

Does there exist a subgraph of G that is 
isomorphic to H? 


SUBSET SUM 

Input: 

A set S of integers and integers k and m. 
Question: 

Does there exist a subset X C S of size at 
most k whose elements sum up to m? 


TSP 

Input: 

A graph G 
w: E(G) =$ Ryo. 
Question: 

Find a closed walk of minimum possible 
total weight that visits all vertices of G. 


with edge weights 


TOTAL DOMINATING SET 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that for every u € V(G) 
there exists v € X with w € E(G)? 


Problem definitions 


TREE SPANNER 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a spanning tree T of G 
such that for every u,v € V(G) it holds 
that dist (u,v) < kdistg(u, v)? 


TREE SUBGRAPH ISOMORPHISM 

Input: 

A graph G and a tree H. 

Question: 

Does there exist a subgraph of G that is 
isomorphic to H? 


TREEWIDTH 

Input: 

A graph G and an integer k. 
Question: 

Is the treewidth of G at most k? 


TREEWIDTH-7 MODULATOR 


Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X has treewidth 
at most n? 


TRIANGLE PACKING 

Input: 

A graph G and an integer k. 

Question: 

Does there exist in G a family of k pairwise 
vertex-disjoint triangles? 


UNIQUE HITTING SET 


Input: 

A universe U, a set A of subsets of U, and 
an integer k. 

Question: 

Does there exist a set X C U of size at most 
k such that |AN X| = 1 for every A € A? 


Unit Disk INDEPENDENT SET 

Input: 

A set P of unit discs in the plane and an 
integer k. 

Question: 

Does there exist a set of k pairwise disjoint 
elements of P? 
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UNIT SQUARE INDEPENDENT SET 


Input: 

A set P of axis-parallel unit squares in the 
plane and an integer k. 

Question: 

Does there exist a set of k pairwise disjoint 
elements of P? 


VARIABLE DELETION ALMOST 2-SAT 
Input: 

A CNF formula y, where every clause of 
y consists of at most two literals, and an 
integer k. 

Question: 

Does there exist a set X of at most k 
variables of y such that y— X is satisfiable? 
Here, p — X denotes the formula y with 
every clause containing at least one variable 
from X deleted. 


VERTEX k-Way CUT 

Input: 

A graph G and integer k and s. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X has at least 
s connected components? 


VERTEX COLORING 


A synonym for CHROMATIC NUMBER. 


VERTEX COVER 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is edgeless? 


VERTEX COVER ABOVE MATCHING 
Input: 

A graph G, a matching M in G, and an 
integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G such that G — X is edgeless? 
Note that the question is the same as in 
VERTEX Cover, but the VERTEX COVER 
ABovE MatcuHina problem is usually 
used in the context of above guarantee 
parameterization with the size of the given 
matching M as a lower bound. 
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VERTEX COVER ABOVE LP 

Input: 

A graph G and an integer k. 

Question: 

Does there exist a set X of at most k ver- 
tices of G such that G — X is edgeless? Note 
that this is the same problem as VERTEX 
Cover, but the name VERTEX COVER 
ABOVE LP is usually used in the context of 
above guarantee parameterization with an 
optimum solution to a linear programming 
relaxation as a lower bound. 


VERTEX DISJOINT PATHS 

Input: 

A graph G and k pairs of vertices (si, t;)#_1. 
Question: 

Do there exist k pairwise vertex-disjoint 
paths Pı, Po,...,P, such that P; starts in 
si and ends in t;? 


VERTEX MULTICUT 

Input: 

A graph G, a set of pairs (s;,ti)f_, of 
vertices of G, and an integer k. 

Question: 

Does there exist a set X of at most k 
vertices of G, not containing any vertex 
si or ti, such that for every 1 < i < 4, 
vertices s; and t; lie in different connected 
components of G — X? 


VERTEX MuLTIWAY CUT 

Input: 

A graph G, a set T C V(G), and an integer 
k. 

Question: 

Does there exist a set X C V(G) \ T of 
size at most k such that every element of T 
lies in a different connected component of 
G-X? 


WEIGHTED CIRCUIT SATISFIABILITY 
Input: 

A Boolean circuit C and an integer k. 
Question: 

Does there exist an assignment to the input 
gates of C that satisfies C and that sets 
exactly k input gates to true? 


Problem definitions 


WEIGHTED INDEPENDENT SET 


Input: 

A graph G with vertex weights 
w:V(G) > Ryo. 

Question: 

Find an independent set in G of the 
maximum possible total weight. 
WEIGHTED LONGEST PATH 

Input: 

A graph GŒ with vertex weights 


w: V(G) > N and an integer k. 
Question: 

Find a simple path in G on k vertices of the 
minimum possible total weight. 


Index 


C-bridge, 
#P-complete problem, [358] 
AND-conjecture, 
O*-notation, 


t, see grid 























k-path, see path 
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Linear Program for Vertex Cover of G, 
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[64] 
apex graph, 216 
apex-minor-free graph, 
arc, [577] 
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Baker’s technique, 
BFS 


breadth-first search, [212] 
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Boolean circuit, 
Boolean formula, 
t-normalized, 
antimonotone, |438 
monotone, 
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of a vertex set,|158 


bramble, 
branch decomposition, [228] 
branch set, [140] 


branching number, 
branching vector, 
branching walk, [325] 
branchwidth, [229] 
brute force, 








Catalan number, 
Catalan structure, 
Cauchy-Schwarz inequality, 1303 
center string, [68] 
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chromatic coding, [99] 
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circuit, [435] 
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input node, 
large nodes, 
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satisfying assignment, 
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cluster 

(p, q)-cluster, 
cluster graph, [40] [115] 
clustering, [248| [264] 
CNF formula, 580| 


color and conquer, 
color coding, [99] 




















composition, 
AND-, [533 
AND-cross-, [533] [534] [554] 
co] TS ea 


computation path, 


conjunctive normal form, 
connected set, 


connected vertex cover, 
connectivity function, |228} 
connectivity problem, 
contraction, see edge contraction 
contraction-closed problem, [207] 
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